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Cross-Reference to Related Applications 
This application claims the benefit of U.S. provisional patent applications 
60/426,132, filed November 14, 2002, 60/485,641, filed July 8, 2003, and 
10 60/487,899, filed July 17, 2003. 

Statement as to Federally Sponsored Research 
The present research was supported by a grant from the National Institutes 
of Health-Nationalinstitute of General Medical Sciences (NIH-NIGMS; grant 
15 number GM52981). The U.S. government has certain rights to this invention. 

Background of the Invention 
The invention relates to compounds (e.g., peptidomimetics and non- 
peptides) that inhibit a cellular proliferative disorder and methods of treating such 
20 disorders. The invention also provides three-dimensional structures of a Polo-like 
kinase and methods for designing or selecting small molecule inhibitors using 
these structures. Desirably, these compounds have certain structural, physical, and 
spatial characteristics that enable the compounds to interact with specific amino 
acid residues. 

25 Cyclin-dependent kinases (Cdks) have long been considered the master 

regulators of the cell-cycle, but an increasing number of diverse protein kinases 
are now emerging as critical components of cell-cycle progression. Among these 
are members of the Polo-like kinase family (Plks) that play key roles during all 
stages of mitosis and in the cell cycle checkpoint response to genotoxic stress. 

30 Many protein kinases involved in cell-cycle control function, in part, by 
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generating phosphoserine/threonine-containing sequence motifs in their substrates 
that are subsequently recognized by phosphoserine/threonine-binding proteins. 
These include the WW and proline isomerase domain of Pin 1 that regulates 
mitotic progression, 14-3-3 proteins that control the G2/M transition in response to 
5 DNA damage, and the WD40 repeat of Cdc4p which regulates S-phase entry. 

In several instances, a phosphopeptide-binding domain and a kinase domain 
are combined within a single molecule, best exemplified by the SH2 domain- 
containing Src kinases and the Rad53p/Chk2-family of FHA domain-containing 
kinases. In these proteins the phosphopeptide-binding domain targets the kinase to 

10 pre-phosphorylated (primed) sites, mediates processive phosphorylation at 

multiple sites within a single substrate, or facilitates kinase activation. Polo-like 
kinases are distinguished by the presence of a conserved Ser/Thr kinase domain 
and a non-catalytic C-terminal region composed of two homologous -70-80 
residue segments termed Polo-boxes. 

15 Humans, mice and frogs each have three Plk homologues denoted Plkl, 

Plk2/Snk, and Plk3/Fnk/Prk, while budding yeast, fission yeast, and flies contain 
only a single Plk family member denoted Cdc5p, Plol, and Polo, respectively. In 
addition, humans and mice have a serine/threonine kinase, Sak, that is an 
extremely divergent member of the Plk family, containing only a single Polo-box 

20 and lacking a canonical PBD. 

The most extensively studied Polo-like kinases, Plkl and Cdc5p, have been 
implicated in numerous mitotic processes including activation of Cdc25C and 
Cdc2-cyclinB at the G2-M transition, centrosome maturation and spindle 
assembly, cohesin release/cleavage during sister chromatid separation, anaphase 

25 promoting complex (APC) activation during mitotic exit, and septin regulation 
during cytokinesis. In contrast human Plk2 and Plk3 appear to serve different 
functions. Plk2 shows peak expression and activity in early Gl, while Plk3 is 
activated by several stress response pathways, including DNA damage and spindle 
disruption. In fact, Plk3 plays some roles that may directly antagonize Plkl 
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function. For example, DNA damage directly inhibits Plkl, but activates Plk3 in 
an Ataxia-Telangiectasia-Mutated (ATM)-dependent manner. Consistent with 
these results, Plkl overexpression causes oncogenic transformation in NIH 3T3 
cells, while overexpression of Plk3 induces apoptosis. 

5 

Summary of the Invention 
We have developed a proteomic approach for identifying targets 
downstream of kinases in signaling pathways. Our strategy involves using an 
immobilized library of partially degenerate phosphopeptides, biased toward a 

10 kinase phosphorylation motif, to isolate interacting effector proteins targeted by 
substrates of that kinase. Utilizing this approach for cyclin-dependent kinases, we 
discovered that the carboxy-terminal region of the cell cycle regulating kinase, 
Plk-1, encodes a phosphopeptide recognition domain that consists of the non- 
kinase region of this protein (amino acids 326-603). This phosphopeptide 

15 recognition domain, termed the Polo-box domain (PBD), binds phosphoserine and 
phosphothreonine residues in a sequence-specific context. Specifically, this PBD 
recognizes and binds to the core phosphopeptide sequence serine-phosphoserine or 
serine-phosphothreonine. 

We performed oriented peptide library screening on the PBDs from all 

20 three human Plk homologues, as well as on the Plkl orthologues Plxl from 

Xenopus and Cdc5p from budding yeast. Despite differences in cellular function, 
we found that all PBDs show strong conserved selection for the core sequence S- 
[pSer/pThr]-P/X. 

To determine the structural basis of PBD activity, the crystal structure of 
25 the human Plkl PBD in complex with its optimal phosphothreonine-containing 
peptide was determined. We identified a mode of phosphopeptide binding that is 
unique among structurally characterized phosphodependent binding 
protein/modules and that is crucial for PBD targeting to substrates both in vitro 
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and in vivo. The architecture of the Plkl PBD differs significantly from that 
recently observed for homodimers of the single Polo-box from murine Sak, which 
lacks a formal PBD (Leung et al., Nat. Struct. Biol. 9:719-724, 2002). The Plkl 
PBD represents a new protein fold. Site-directed mutagenesis based on the 
5 structural identification of critical phosphothreonine-binding residues has enabled 
us to demonstrate that phosphodependent substrate recognition by the PBD is 
necessary for proper mitotic progression. Furthermore, binding of the optimal 
Plkl phosphopeptide to the PBD in full-length Plkl enhances the in vitro activity 
of the kinase domain, leading to a model for Plk regulation in which 

10 intramolecular inhibition of the kinase by the PBD is relieved by PBD-ligand 
binding. We conclude that phosphoserine/threonine-dependent binding is a 
general feature of PBD activity across the Plk family and critically important for 
the function of this domain in Polo-like kinase targeting and regulation. These 
studies have identified sites that may be targeted in designing therapeutics useful 

15 in treating diseases or disorders characterized by inappropriate cell cycle 
regulation or inappropriate cell death. 

We applied the same proteomic approach to identify phosphopeptide- 
binding modules mediating signal transduction events in the DNA damage 
response pathway. Using a library of partially degenerate phosphopeptides biased 

20 to resemble the phosphorylation motif of the phosphoinositide-like kinases ATM 
and ATR, we identified tandem BRCT domains in PTIP and BRCA1 as 
phosphoserine (pSer)- or phosphothreonone (pThr)-specific binding modules that 
recognize a subset of ATM (ataxia telangiectasia-mutated) and ATR (ataxia 
telangiectasia- and RAD3 -related) -phosphorylated substrates following y- 

25 irradiation. PTIP tandem BRCT domains are responsible for phosphorylation- 
dependent protein localization into 53BPl-and phospho-H2AX (_-H2AX)- 
containing nuclear foci, a marker of DNA damage.These findings provide a new 
molecular rationale for BRCT domain function in the signaling response to DNA 
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damage and may help to explain why the BRCA1 BRCT domain mutation 
Met 1775 3 Arg, which fails to bind phosphopeptides, predisposes 
women to breast and ovarian cancer.. 

In one aspect, the invention generally features computer containing a 
5 processor in communication with a memory; the memory having stored therein (i) 
at least one atomic coordinate, or surrogates thereof, from Table 5 for each of the 
following residues: His-538, Lys-540, Trp-414, or Leu-491 of a Polo-box domain 
or atomic coordinates that have a root mean square deviation of the coordinates of 
less than 3 A; and (ii) a program for generating a three-dimensional model of the 

10 coordinates. In one embodiment, the coordinate is for a heteroatom. In another 
embodiment, the coordinate is for a side-chain atom. In another embodiment, the 
coordinate is for a side-chain and a heteroatom. 

In another aspect, the invention generally features a computer containing a 
processor in electrical communication with a memory; the memory having stored 

1 5 therein (i) atomic coordinates, or surrogates thereof, as shown in Table 5 for atoms 
of residues His-538, Lys-540, Trp-414, or Leu-491 of a Plkl Polo-box domain or 
atomic coordinates that have a root mean square deviation from the cooridinates of 
the residues of less than 1, 2, 3, 4, or 5 A; and (ii) a program for displaying a 
three-dimensional model of the Polo-box domain. 

20 In another aspect, the invention provides a computer containing a processor 

in communication with a memory; the memory having stored therein (i) x-ray 
diffraction data for at least one of the non-hydrogen atoms of residues His-538, 
Lys-540, Trp-414, or Leu-491 of a Polo-box domain or x-ray diffraction data for 
amino acids that have a root mean square deviation from the backbone atoms of 

25 the residues of less than 1, 2, 3, 4, or 5 A; and (ii) a program for generating a 
three-dimensional model of the Polo-box domain. 

In another aspect, the invention provides a computer containing a processor 
in communication with a memory; the memory having stored therein a 
pharmacophore model of a phosphopeptide that binds a Polo-box domain and a 
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program for displaying the model, the model containing at least one of the 
following: a phosphate group on threonine that participates in at least 1 hydrogen- 
bonding interaction; and a serine at the pThr-1 position, where the Ser-1 side chain 
is directed towards the Plkl surface. In one embodiment, the serine engages in at 
5 least two of the following (i) a hydrogen bonding interaction with Trp-414 main- 
chain atoms of PBD; (ii) a hydrogen bonding interaction with Leu-491 main-chain 
carbonyl of PBD; and (iii) a van der Waals interaction with C81 from the Trp-414 
indole side chain of PBD. In one embodiment, the model further comprises a 
Proline at the pThr+1 position, where the proline introduces a kink that allows a 

10 pThr+2 main chain amino group to contact PBD. 

In another aspect, the invention provides a method of selecting or designing 
a candidate ligand for a Polo-box domain, the method involves the steps of: (a) 
generating a three-dimensional structure of a Polo-box domain having at least one 
atomic coordinate, or surrogate thereof, from Table 5 for each of the following 

15 residues: His-538, Lys-540, Trp-414, or Leu-491 or atomic coordinates that have a 
root mean square deviation from the coordinates of less than 1, 2, 3, 4, or 5 A; and 
(b) selecting or designing a candidate ligand having sufficient surface 
complementary to the structure to bind a Polo-box domain in an aqueous solution. 
In another aspect, the invention provides a method for manufacturing a Polo-box 

20 domain ligand, the method involves the steps of: (a) obtaining the atomic 
coordinates of at least one residue of a Polo-box domain with a ligand; (b) 
determining one or more moieties in the ligand to be modified; where the modified 
ligand maintains the ability to bind the Polo-box domain; and (c) modifying the 
ligand based on the determination. In one embodiment, the method further 

25 involves crystallizing a Polo-box domain with a ligand. In another embodiment, 
the ligand specifically binds the Polo-box domain. In another embodiment, the 
modification increases the affinity of the ligand for the Polo-box domain. In 
another embodiment, the modification increases the solubility of the ligand. In 
another embodiment, the modification increases the half-life of the ligand in vivo. 
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In another aspect, the invention provides a method for manufacturing a 
Polo-box domain ligand, the method involves manufacturing a ligand that binds a 
Polo-box domain; where the ligand is designed or selected based on information 
obtained using a model of the atomic coordinates of at least a portion of the Polo- 
5 box domain. 

In another aspect, the invention provides a method of evaluating the ability 
of a candidate ligand to bind a Polo-box domain, the method involves the steps of: 

(a) generating a three-dimensional structure of a Polo-box domain having at least 
one atomic coordinate, or surrogate thereof, from Table 5 for each of the following 

10 residues: His-538, Lys-540, Trp-414, or Leu-491 or atomic coordinates that have a 
root mean square deviation from the coordinates of less than 1, 2, 3, 4, or 5 A; and 

(b) employing a means to measure the interaction between the candidate ligand 
and the Polo-box domain. 

In another aspect, the invention provides a method of identifying a 
15 candidate ligand for a Polo-box domain, the method involves the steps of: (a) 

generating a three-dimensional pharmacophore model of Polo-box domain ligands 
using a computer of a previous aspect; and (b) selecting a candidate ligand 
satisfying the criteria of the pharmacophore model. In various embodiments, of 
any previous aspect, the method further involves determining the ability of the 
20 candidate ligand to bind the Polo-box domain in vitro or in vivo. In other 

embodiments, the method further involves determining the ability of the candidate 
ligand to alter the enzymatic activity of the Polo-box domain in vitro or in vivo. In 
other embodiments, the three-dimensional structure further comprises the 
hydrogen atoms of residues His-538, Lys-540, Trp-414, or Leu-491. 
25 In various embodiments of the above aspects, the coordinate is for a 

heteroatom, or a side-chain atom, or a side-chain and a heteroatom. In other 
embodiments, the memory stores at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 coordinates 
or surrogates thereof for His-538; at least 1, 2, 3, 4, 5, 6, 7, 8, or 9 coordinates or 
surrogates thereof for Lys-540, at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, or 14 
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coordinates or surrogates thereof for Trp-414; or at least 1 5 1, 2, 3, 4, 5, 6, 7, or 8 
coordinates or surrogates thereof for Leu-491 . In other embodiments, the 
coordinate is any one or all of the atomic coordinates in Table 5. In other 
embodiments of the previous aspect, the coordinates are for any residue required 
5 for the biological activity of a Polo box domain, or for binding a phosphopeptide 
or peptide mimetic. In other embodiments of any of the above aspects, root mean 
square deviation of the coordinates of less than 1, 2, 3, 4, 5, 6, or 7 A. 

In another aspect, the invention features a crystal of a Polo-like kinase 
complex containing a Polo-box domain bound to a phosphopeptide. In one 

10 embodiment, the the Polo-like kinase is Plk-1 . In another embodiment, the Plk-1 
comprises at least amino acids 1-603 of SEQ ID NO: 1. In another embodiment, 
the Plk-1 comprises at least amino acids 95-603. In another embodiment, the Plk- 
1 comprises at least amino acids 326-603. In another embodiment, the Plk-1 
comprises at least amino acids 367-603. In another embodiment, the 

15 phosphopeptide comprises the amino acid sequence [Pro/Phe]-[(j)/Pro]- 
[())/Alacdc5p/Glnp, k2 ]-[Thr/Gln/His/Met]-Ser-[pThr/pSer]-[Pro/X], where ()) 
represents hydrophobic amino acids. In another embodiment, the phosphopeptide 
comprises the amino acid sequence MAGPMQ-S-pT-P-LNGAKK. In another 
embodiment, the Polo-like kinase is Plk-2. In another embodiment, the Polo-like 

20 kinase is Plk-3 

In another aspect, the invention provides a method of obtaining a structural 
model of a Polo-box domain of interest, the method involves homology modeling 
using at least a portion of the atomic coordinates in Table 5 and at least a portion 
of the amino acid sequence of the Polo-box domain of interest, thereby generating 

25 a model of the Polo-box domain of interest. 

In another aspect, the invention provides a method of determining the three- 
dimensional structure of a Polo-box domain/phosphopeptide complex of interest, 
the method involves the steps of: (a) crystallizing the Polo-box 
domain/phosphopeptide complex of interest; (b) generating an X-ray diffraction 
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pattern from the crystallized Polo-box domain of interest; and (c) applying at least 
a portion of the atomic coordinates in Table 5 to the diffraction pattern to generate 
a three-dimensional electron density map of at least a portion of the Polo-box 
domain/phosphopeptide complex of interest. 
5 In another aspect, the invention features an isolated, less than full-length 

fragment of Polo-box domain containing residues 367-603 of human Plk-1 Polo- 
box domain) in complex with a phosphopeptide containing S-[pS/pT]-P/X, where 
X is any amino acid. 

In another aspect, the invention features an isolated, less than full-length 

10 fragment of Polo-box domain containing residues residues 500-685 of human Plk- 
2 Polo-box domain in complex with a phosphopeptide containing S-[pS/pT]-P/X, 
where X is any amino acid. 

In another aspect, the invention features an isolated, less than full-length 
fragment of Polo-box domain containing residues residues 421-607 of human Plk- 

15 3 Polo-box domain in complex with a phosphopeptide containing S-[pS/pT]-P/X, 
where X is any amino acid. 

In another aspect, the invention features an isolated Polo-box domain 
protein or fragment thereof containing a mutation, where the mutation is (a) a 
mutation that enhances the ability of Polo-box domain to crystallize; (b) a 

20 mutation of a residue that is otherwise post-translationally modified in an 

organism used for recombinant expression; (c) a mutation of the NH2- or COOH- 
terminal residue of Polo-box domain; (d) a mutation that increases or decreases the 
affinity of a Polo-box domain for a phosphopeptide; or (e) a mutation that alters 
the folding of Polo-box domain. In one embodiment, the PBD further comprises a 

25 mutation at His-538, Lys-540, Trp-414, or Leu-491. In other embodiments, the 
nucleic acid encodes a protein of any previous aspect. 

In another aspect, the invention features a phosphopeptide containing the 
amino acid sequence [Pro/Phe]-[())/Pro]-[(l)/Alacdc5p/Glnpi k 2]4Thr/Gln/His/Met]- 
Ser-[pThr/pSer]-[Pro/X], where (]) represents hydrophobic amino acids. In one 
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embodiment, the phosphopeptide comprises Pro-Met-Gln-Ser-pThr-Pro-Leu, 
where the phosphopeptide binds human Plk-1 . 

In another aspect, the invention features a phosphopeptide containing the 
amino acid sequence, 



Met 

Tyr 

Phe 

He 

Leu 

His 

Lys 



Ala 

His 

Met 

Thr 

Phe 

Gin 



Ser 
Ala 
Gly 
Thr 



pSer 
pThr 



P-3 P-2 P-1 PO , 
where pSer and pThr are phosphorylated serine and phosphorylated threonine, 
and where the amino acids designated in P-3, P-2, or PI may be natural or 
unnatural amino acids. In one embodiment, the phosphopeptide of the previous 
aspect further contains the amino acid sequence, 





Met 










Tyr 




Ala 








Phe 




His 




Ser 


X-,aa 


He 




Met 




Ala 


Leu 




Thr 




Gly 




His 




Phe 




Thr 




Lys 




Gin 






P-4 


P-3 


P-2 


P-1 



pSer 




Pro 


pThr 




Met 






Asn 


PO 


P+1 



X 2 aa 



P+2 



where Xiaa and X 2 aa are any amino acids and where pSer and pThr are 
phosphorylated serine and phosphorylated threonine. In another embodiment, the 
Xiaa is proline and where X 2 aa is any amino acid. In another embodiment, the 
Xiaa is any amino acid and where X 2 aa is alanine, leucine, valine, isoleucine, 
phenylalanine, tyrosine, and tryptophan. In another embodiment, the X 2 aa is 
leucine. In another embodiment, the amino acid at position P-3 is methionine. In 
another embodiment, the amino acid at position P-2 is glutamine. In another 
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embodiment, the amino acid at position P-l is serine. In another embodiment, the 
amino acid at position PO is phosphorylated serine. In another embodiment, the 
amino acid at position PO is phosphorylated threonine. In another embodiment, 
the amino acid at position P+l is proline. In another embodiment, the amino acid 
5 sequence is Met-Gln-Ser-pThr-Pro-Leu or Met-Gln-Ser-pSer-Pro-Leu, where Xiaa 
is any amino acid and pThr is phosphorylated threonine and pSer is 
phosphorylated serine. In another embodiment, the phosphopeptide does not 
exceed 25 amino acids residues. In another embodiment, the phosphopeptide does 
not exceed 1 5 amino acids residues. In another embodiment, the phosphopeptide 

10 does not exceed 10 amino acids residues. 

In another aspect, the invention features a pharmaceutical composition 
containing a therapeutic effective dose of any of the phosphopeptides of the 
previous aspects and a pharmaceutically acceptable excipient, where the 
pharmaceutical composition is useful for the treatment of a disorder characterized 

15 by inappropriate cell cycle regulation. In one embodiment, the cellular 

proliferative disorder is a neoplasm. In another embodiment, the composition 
further comprises a second chemotherapeutic agent; In another embodiment, the 
second chemotherapeutic agent is selected from the group consisting of paclitaxel, 
gemcitabine, doxorubicin, vinblastine, etoposide, 5-fluorouracil, carboplatin, 

20 altretamine, aminoglutethimide, amsacrine, anastrozole, azacitidine, bleomycin, 
busulfan, carmustine, chlorambucil, 2-chlorodeoxyadenosine, cisplatin, colchicine, 
cyclophosphamide, cytarabine, Cytoxan, dacarbazine, dactinomycin, daunorubicin, 
docetaxel, estramustine phosphate, floxuridine, fludarabine, gentuzumab, 
hexamethylmelamine, hydroxyurea, ifosfamide, imatinib, interferon, irinotecan, 

25 lomustine, mechlorethamine, melphalen, 6-mercaptopurine, methotrexate, 
mitomycin, mitotane, mitoxantrone, pentostatin, procarbazine, alemtuzumab, 
rituximab, streptozocin, tamoxifen, temozolomide, teniposide, 6-thioguanine, 
topotecan, trastuzumab, vincristine, vindesine, rofecoxib, celecoxib, etodolac and 
vinorelbine. 
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In another aspect, the invention features a method for treating or inhibiting 
a cellular proliferative disorder in a patient, the method involves administering a 
pharmaceutical composition of the phosphopeptide of a previous aspect, where the 
phosphopeptide is in an amount sufficient to treat or inhibit the cellular 
5 proliferative disorder in the patient. In one embodiment, method includes 
administering a second chemotherapeutic agent, the phosphopeptide and the 
chemotherapeutic agent are in amounts sufficient to treat or inhibit the cellular 
proliferative disorder in the patient, and where the chemotherapeutic agent is 
administered simultaneously or within 1, 2, 3, 5, 7, 10, 14, or 28 days of 

10 administering the phosphopeptide. In another embodiment, the second 

chemotherapeutic agent is selected from the group consisting of paclitaxel, 
gemcitabine, doxorubicin, vinblastine, etoposide, 5-fluorouracil, carboplatin, 
altretamine, aminoglutethimide, amsacrine, anastrozole, azacitidine, bleomycin, 
busulfan, carmustine, chlorambucil, 2-chlorodeoxyadenosine, cisplatin, colchicine, 

15 cyclophosphamide, cytarabine, Cytoxan, dacarbazine, dactinomycin, daunorubicin, 
docetaxel, estramustine phosphate, floxuridine, fludarabine, gentuzumab, 
hexamethylmelamine, hydroxyurea, ifosfamide, imatinib, interferon, irinotecan, 
lomustine, mechlorethamine, melphalen, 6-mercaptopurine, methotrexate, 
mitomycin, mitotane, mitoxantrone, pentostatin, procarbazine, alemtuzumab, 

20 rituximab, streptozocin, tamoxifen, temozolomide, teniposide, 6-thioguanine, 

topotecan, trastuzumab, vincristine, vindesine, rofecoxib, celecoxib, etodolac and 
vinorelbine, or any other chemotherapeutic known in the art. In other 
embodiments, the cellular proliferative disorder is a neoplasm. 

In another aspect, the invention features a method for identifying a 

25 peptidomimetic compound that modulates Polo-like kinase biological activity, the 
method involves the steps of: a) contacting the phosphopeptide of a previous 
aspect and a Polo-box domain (PBD) polypeptide to form a complex between the 
phosphopeptide and the PBD; b) contacting the complex with a candidate 
compound; and c) measuring the displacement of the phosphopeptide from the 
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PBD, where the displacement of the phosphopeptide from the PBD indicates that 
the candidate compound is a peptidomimetic compound that modulates Polo-like 
kinase biological activity. 

In another aspect, the invention provides a method for identifying a 
5 peptidomimetic compound that modulates Polo-like kinase biological activity, the 
method involves the steps of: a) contacting the phosphopeptide of a previous 
aspect and a PBD in the presence of a candidate compound; and b) measuring 
binding of the phosphopeptide and the PBD, where a reduction in the amount of 
binding relative to the amount of binding of the phosphopeptide and the 

10 polypeptide in the absence of the candidate compound indicates that the candidate 
compound is a peptidomimetic compound that modulates Polo-like kinase 
biological activity. In one embodiment, the phosphopeptide or the PBD is 
detectably labeled. In another embodiment, the phosphopeptide and the PBD are 
differentially labeled. In another embodiment, the PBD is selected from a group 

15 consisting of the PBDs of Cdc5, Plo-1, Polo, Plx-1, Plx-2, Plx-3, Plk-1, Prk/Fnk, 
Snk, and Cnk. In another embodiment, the PBD is Plk-1 PBD. In another 
embodiment, the Plk-1 PBD is human Plk-1 PBD. 

In another aspect, the invention provides a method for identifying a binding 
pair consisting of a peptide and a peptide-binding domain, the method involes the 

20 steps of: a) providing a biased peptide library containing a collection of peptides 
fixed to a solid support, each peptide having at least two known amino acid 
residues whose position is invariant; b) providing a pooled cDNA library, where 
the cDNA library is positioned for protein expression; c) expressing the pooled 
cDNA library in the presence of a detectable label; d) contacting the peptide 

25 library and the expressed cDNA library; and e) detecting a peptide and peptide- 
binding domain interaction, where an interaction identifies a peptide and peptide- 
binding domain binding pair. In one embodiment, the biased peptide library is 
covalently bound to a solid support. In another embodiment, the biased peptide 
library is noncovalently bound to a solid support. In another embodiment, the 
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peptide is a phosphopeptide and the peptide binding domain is a phosphopeptide 
binding domain. 

In another aspect, the invention provides a method for identifying a binding 
pair containing a phosphopeptide and a phosphopeptide binding domain, the 
5 method involves the steps of: a) providing a biased phosphopeptide library, 

containing a collection of peptides fixed to a solid support, each peptide having at 
least two known amino acid residues whose position is invariant; where each 
phosphopeptide is covalently linked to a biotin group at the amino terminus; b) 
providing a pooled cDNA library, where the pooled cDNA library is positioned 

10 for protein expression; c) expressing the pooled cDNA library in the presence of a 
detectable label; d) contacting the phosphopeptide library and the expressed cDNA 
library; and e) detecting a phosphopeptide and the phosphopeptide binding domain 
interaction, where the presence of an interaction identifies a phosphopeptide and 
phosphopeptide binding domain. In one embodiment, method further comprises 

15 the steps of f) providing a non-phosphorylated peptide of step a), and g) detecting 
a peptide and phosphopeptide-binding domain interaction, where the absence of an 
interaction indicates the phosphopeptide and phosphopeptide binding domain 
interaction is authentic. 

In another aspect, the invention provides a method for identifying a binding 

20 pair consisting of a peptide and a peptide-binding domain; the method involves the 
steps of: a) providing a biased peptide library containing a collection of peptides 
fixed to a solid support, each peptide having at least two known amino acid 
residues whose position is invariant; b) contacting the biased peptide library with a 
detectably labeled peptide library; and c) detecting a biased peptide and detectably 

25 labeled peptide interaction, where an interaction identifies a peptide and peptide- 
binding domain binding pair. 

In another aspect, the invention features a method to identify 
phosphopeptide-binding modules, the method involves the steps of: (a) providing 
an immobilized phosphopeptide library and an immobilized peptide library; (b) 
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contacting the libraries with a polypeptide or polypeptide fragment; and (c) 
detecting preferential binding, where preferential binding to the phosphopeptide 
library in comparison to the peptide library identifies the polypeptide or 
polypeptide fragment as a phosphopeptide binding module. 
5 In another aspect, the invention provides a method to identify non- 

phosphopeptide-binding modules, the method involves the steps of: (a) providing 
an immobilized degenerate phosphopeptide library and an immobilized peptide 
library; (b) contacting the libraries with a polypeptide or polypeptide fragment; 
and (c) detecting preferential binding, where preferential binding to the peptide 

10 library in comparison to the phosphopeptide library identifies the polypeptide or 
polypeptide fragment as a non-phosphopeptide binding module. 

In another aspect, the invention provides a method to identify 
phosphopeptide-binding modules in the DNA damage response pathway, the 
method involves the steps of: (a) providing an immobilized pSer or pThr 

15 degenerate phosphopeptide library and an immobilized Ser or Thr peptide library; 
(b) contacting the libraries with a polypeptide or polypeptide fragment; and (c) 
detecting differential binding, where preferential binding to the phosphopeptide 
library in comparison to the peptide library identifies the polypeptide or 
polypeptide fragment as a phosphopeptide binding module. In one embodiment, 

20 the phosphopeptide or peptide libraries do not have the amino acids Arg, Lys, or 
His in a degenerate position in the libraries. In another embodiment, the 
polypeptides or polypeptide fragments are in vitro translated (IVT) polypeptides. 

In another aspect, the invention features a degenerate phosphopeptide 
containing a pSer or pThr that binds a BRCT domain. In one embodiment, the 

25 phosphopeptide further comprises an aromatic or aliphatic residue in the pSer or 
pThr +3 position; aromatic or aliphatic residues in the pSer or pThr +3 or +5 
positions; a Gin or an aromatic or an aliphatic residue in the +1 position; or the 
amino acid sequence Y-D-I-(pSer or pThr)-Q-V-F-P-F. 
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In another aspect, the invention features a phosphopeptide binding module 
containing a BRCT tandem domain. In one embodiment, the BRCT tandem 
domain comprises at least 100 amino acids of the 3rd and 4th BRCT domains of 
PTIP. In another embodiment, the BRCT pair comprises at least 100 amino acids 
5 of the BRCT domains of BRCA1 . In another embodiment, the tandem domain 
functions as a single module in phosphopeptide binding. 

In another aspect, the invention features an isolated fragment (e.g, 50, 100, 
150, 200, 250, or 300 amino acids) of tandem BRCT domains of PTIP or BRCA1 
in complex with a phosphopeptide containing a pSer or pThr amino acid. 

10 In another aspect, the invention features a complex containing a tanderti 

BRCT phosphopeptide binding module and a phosphopeptide containing a pSer or 
pThr. In one embodiment, the tandem BRCT phosphopeptide binding module is a 
fragment of PTIP in complex with a phosphopeptide. In another embodiment, the 
phosphopeptide further comprises an aromatic or aliphatic residue in the (pSer or 

15 pThr)+3 position; an aromatic or aliphatic residues in the (pSer or pThr)+3 or +5 
positions a Gin, or an aromatic or aliphatic residue in the +1 position; or the amino 
acid sequence Y-D-I-(pSer or pThr)-Q-V-F-P-F. In another aspect, the invention 
provides a method for identifying a candidate compound for the treatment or 
prevention of a neoplasia, the method containing detecting binding of the 

20 phosphopeptide binding module to a phosphopeptide in the presence of the 

candidate compound, where a candidate compound that modulates the binding is a 
compound useful for the treatment or prevention of a neoplasia. In one 
embodiment, binding is detected using an immunological assay, an enzymatic 
assay, or a radioimmunoassay. In another embodiment, the phosphopeptide 

25 binding module or fragment thereof is an isolated phosphopeptide binding module. 
In another embodiment, the phosphopeptide binding module or fragment thereof is 
an isolated phosphopeptide containing a pSer or pThr. In one embodiment, 
phosphopeptide is fixed to a solid support. In another embodiment, the 
phosphopeptide binding module is a tandem BRCT binding domain. In another 
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embodiment, the phosphopeptide binding module is fixed to a solid support. In 
another embodiment, the binding is assayed using an immunological assay, an 
enzymatic assay, or a radioimmunoassay. In another embodiment, the candidate 
compound is preincubated with the phosphopeptide binding module. In another 
5 embodiment, the candidate compound is preincubated with the phosphopeptide. In 
another embodiment, the phosphopeptide binding module and the phosphopeptide 
form a complex prior to being contacted with the candidate compound. In another 
embodiment, the candidate compound, the phosphopeptide and the 
phosphopeptide binding module are contacted concurrently. 

10 In another aspect, the invention features a method for identifying a candidate 
compound useful in treating or preventing a neoplasia in a subject, the method 
involves:(a) providing a cell expressing a phosphopeptide binding module or 
fragment thereof and a phosphopeptide containing a pSer or pThr; (b) contacting 
the cell with a candidate compound; and (c) comparing binding of the 

1 5 phosphopeptide binding module and the phosphopeptide in the cell contacted with 
the candidate compound to the binding in a control cell, where a modulation of the 
binding identifies the candidate compound as a compound useful to treat or 
prevent a neoplasia in a subject. In one embodiment, phosphopeptide binding 
moduleand the phosphopeptide are expressed in a prokaryotic or a eukaryotic cell 

20 in vitro. In another embodiment, the phosphopeptide binding module is expressed 
endogenously by the cell. In another embodiment, the phosphopeptide binding 
module is expressed as a recombinant protein. In another embodiment, the cell is 
a neoplastic cell. In another embodiment, the neoplastic cell is a mammalian cell. 
In another embodiment, the neoplastic cell is a human cell. In another 

25 embodiment, the candidate compound decreases the affinity of the binding. 

In another aspect, the invention features a pharmaceutical composition 
containing (i) a phosphopeptide containing a pSer or pThr and (ii) a 
pharmaceutically acceptable carrier, where the phosphopeptide is present in 
amounts that, when administered to a subject, ameliorates a neoplastic disease. In 
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one embodiment, the compositions comprises a second chemotherapeutic agent. 
In another embodiment, the second chemotherapeutic agent is selected from the 
group consisting of paclitaxel, gemcitabine, doxorubicin, vinblastine, etoposide, 5- 
fluorouracil, carboplatin, altretamine, aminoglutethimide, amsacrine, anastrozole, 
5 azacitidine, bleomycin, busulfan, carmustine, chlorambucil, 2- 

chlorodeoxyadenosine, cisplatin, colchicine, cyclophosphamide, cytarabine, 
Cytoxan, dacarbazine, dactinomycin, daunorubicin, docetaxel, estramustine 
phosphate, floxuridine, fludarabine, gentuzumab, hexamethylmelamine, 
hydroxyurea, ifosfamide, imatinib, interferon, irinotecan, lomustine, 
10 mechlorethamine, melphalen, 6-mercaptopurine, methotrexate, mitomycin, 
mitotane, mitoxantrone, pentostatin, procarbazine, alemtuzumab, rituximab, 
streptozocin, tamoxifen, temozolomide, teniposide, 6-thioguanine, topotecan, 
trastuzumab, vincristine, vindesine, rofecoxib, celecoxib, etodolac and 
vinorelbine. 

15 In another aspect, the invention provides a method for treating or inhibiting 

a cellular proliferative disorder in a patient, the method involves administering a 
pharmaceutical composition of the phosphopeptide of a previous aspect, where the 
phosphopeptide is in an amount sufficient to treat or inhibit the cellular 
proliferative disorder in the patient. In one embodiment, the method includes 

20 administering a second chemotherapeutic agent, the phosphopeptide and the 
chemotherapeutic agent are in amounts sufficient to treat or inhibit the cellular 
proliferative disorder in the patient, and where the chemotherapeutic agent is 
administered simultaneously or within fourteen days of administering the 
phosphopeptide. In another embodiment, the second chemotherapeutic agent is 

25 selected from the group consisting of paclitaxel, gemcitabine, doxorubicin, 

vinblastine, etoposide, 5-fluorouracil, carboplatin, altretamine, aminoglutethimide, 
amsacrine, anastrozole, azacitidine, bleomycin, busulfan, carmustine, 
chlorambucil, 2-chlorodeoxyadenosine, cisplatin, colchicine, cyclophosphamide, 
cytarabine, Cytoxan, dacarbazine, dactinomycin, daunorubicin, docetaxel, 
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estramustine phosphate, floxuridine, fludarabine, gentuzumab, 
hexamethylmelamine, hydroxyurea, ifosfamide, imatinib, interferon, irinotecan, 
lomustine, mechlorethamine, melphalen, 6-mercaptopurine, methotrexate, 
mitomycin, mitotane, mitoxantrone, pentostatin, procarbazine, alemtuzumab, 
5 rituximab, streptozocin, tamoxifen, temozolomide, teniposide, 6-thioguanine, 
topotecan, trastuzumab, vincristine, vindesine, rofecoxib, celecoxib, etodolac and 
vinorelbine. In another embodiment, the cellular proliferative disorder is a 
neoplasm. 

In another aspect, the invention features a method for identifying a 

10 peptidomimetic compound that modulates BRCT biological activity, the method 
involves the steps of: a) contacting the phosphopeptide of claim a previous aspect 
and a BRCT binding domain domain polypeptide to form a complex between the 
phosphopeptide and the BRCT; b) contacting the complex with a candidate 
compound; and c) measuring the displacement of the phosphopeptide from the 

15 BRCT binding domain, where the displacement of the phosphopeptide from the 
BRCT binding domain indicates that the candidate compound is a peptidomimetic 
compound that modulates BRCT binding domain biological activity. 

In another aspect, the invention features a method for identifying a 
peptidomimetic compound that modulates BRCT binding domain biological 

20 activity, the method involves the steps of: a) contacting the phosphopeptide of a 
previous aspect and a BRCT binding domain in the presence of a candidate 
compound; and b) measuring binding of the phosphopeptide and the BRCT 
binding domain, where a reduction in the amount of binding relative to the amount 
of binding of the phosphopeptide and the polypeptide in the absence of the 

25 candidate compound indicates that the candidate compound is a peptidomimetic 
compound that modulates BRCT binding domain biological activity. In one 
embodiment, the phosphopeptide or the BRCT binding domain is detectably 
labeled. In another embodiment, the phosphopeptide and the BRCT binding 
domain are differentially labeled. In other embodiments, the BRCT binding 
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domain is BRCA1 or PTIP. In another embodiment, the BRCT binding domain is 
of human BRCA1 . In one embodiment, BRCT binding domain is of human PTIP. 

In another aspect, the invention features a kit containing (i) a small 
molecule that binds a BRCT binding domain and (ii) instructions for administering 
5 the small molecule to a patient diagnosed with or having a propensity to develop a 
neoplasia. In one embodiment, the kit further comprises a second 
chemotherapeutic compound. 

In another aspect, the invention features a method of assessing a patient as 
having, or having a propensity to develop, a neoplasia, the method involves 
determining the level of expression of an a BRCT binding domain nucleic acid 
molecule or polypeptide in a patient sample, where an increased level of 
expression relative to the level of expression in a control sample, indicates that the 
patient has or has a propensity to develop a neoplasia. In one embodiment, the 
patient sample is a blood or tissue sample. In another embodiment, the method 
comprises determining the level of expression of the BRCT binding domain 
nucleic acid molecule. In another embodiment, the method comprises determining 
the level of expression of the a BRCT binding domain polypeptide. In another 
embodiment, the level of expression is determined in an immunological assay. In 
another embodiment, the method is used to diagnose a patient as having neoplasia. 

In another aspect, the invention features a method to identify a peptide- 
binding module, the method involves the steps of: (a) providing an immobilized 
10 modified peptide library and an immobilized peptide library; (b) contacting the 
libraries with a polypeptide or polypeptide fragment; and (c) detecting preferential 
binding, where preferential binding to the modified peptide library in comparison 
to the peptide library identifies the polypeptide or polypeptide fragment as a 
modified peptide binding module. 
15 In another aspect, the invention features a method for identifying a binding 

pair consisting of a modified peptide and a peptide-binding domain, the method 
involves the steps of: a) providing a biased peptide library containing a collection 
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of modified peptides fixed to a solid support, each peptide having one amino acid 
residues whose position is invariant; b) providing a pooled cDNA library, where 
the cDNA library is positioned for protein expression; c) expressing the pooled 
cDNA library in the presence of a detectable label; d) contacting the peptide 
5 library and the expressed cDNA library; and e) detecting a modified peptide and 
peptide-binding domain interaction, where an interaction identifies a modified 
peptide and peptide-binding domain binding pair. In one embodiment, the amino 
acid contains a modification that is natural or unnatural. In another embodiment, 
the modification is selected from the group consisting of methylation, acetylation, 

10 ubiquitination, glycosylation, sumolation, or arsenylation, or any other 
modification known to the skilled artisan. 

In various embodiments of any of the above aspects, the peptide includes 
unnatural amino acids as described herein. 

By "analog" is meant a molecule that is not identical but has analogous 

15 features. For example, a peptide analog retains the biological activity of a 
corresponding naturally-occurring peptide, while having certain biochemical 
modifications that enhance the analogs function relative to a naturally occurring 
peptide. Such biochemical modifications might increase the analogs protease 
resistance, membrane permeability, or half-life, without altering, for example, 

20 ligand binding. An analog can include a non-natural amino acid. 

In another example, a nucleic acid analog retains the ability to hybridize to 
a naturally-occurring corresponding nucleic acid sequence, while having certain 
biochemical modifications that enhance the analogs function relative to a 
naturally-occurring nucleic acid. In some nucleic acid analogs the sugar and/or 

25 the internucleoside linkage, i.e., the backbone, of the nucleotide units are replaced 
with novel groups. The base units are maintained for hybridization with an 
appropriate nucleic acid target compound. Peptide and nucleic acid modifications 
may be achieved by any of the techniques known in the art for derivatization of 
peptides or nucleic acids into fragments, analogs, or derivatives thereof. Such 
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terms and in particular, "analog", also specifically include peptide, non-peptide, 
peptide/nucleic acid hybrid molecules, small molecules and other compounds that 
function as Polo-like kinase nucleic acid or peptide mimics. 

By "apoptosis" is meant the process of cell death where a dying cell 
5 displays at least one of a set of well-characterized biological hallmarks, including 
cell membrane blebbing, cell soma shrinkage, chromatin condensation, or DNA 
laddering. 

By "biased phosphopeptide library' 5 is meant a phosphoserine, 
phosphothreonine, and/or phosphotyrosine degenerate peptide library, wherein 

10 specific amino acid residues of the phosphopeptide are fixed so as to be expressed 
in all phosphopeptides in the specific library. For instance, a biased 
phosphopeptide library can be synthesized to contain the core sequence Ser-pSer- 
Pro or Ser-pThr-Pro. In a desirable embodiment, the amino acid residue adjacent 
to the phosphoserine, phosphothreonine, or phosphotyrosine residue is fixed. 

15 By an "amino acid fragment" is meant an amino acid residue that has been 

incorporated into a peptide chain via its alpha carboxyl, its alpha nitrogen, or both. 
A terminal amino acid is any natural or unnatural amino acid residue at the amino- 
terminus or the carboxy-terminus. An internal amino acid is any natural or 
unnatural amino acid residue that is not a terminal amino acid. 

20 As used herein, the terms "alkyl" and the prefix "alk-" are inclusive of both 

straight chain and branched chain groups and of cyclic groups, i.e., cycloalkyl and 
cycloalkenyl groups. Cyclic groups can be monocyclic or polycyclic and 
preferably have from 3 to 8 ring carbon atoms, inclusive. Exemplary cyclic 
groups include cyclopropyl, cyclopentyl, cyclohexyl, and adamantyl groups. 

25 By "aromatic residue" is meant an aromatic group having a ring system 

with conjugated n electrons (e.g., phenyl or imidazole). The ring of the aryl group 
is preferably 5 to 6 atoms. The aromatic ring may be exclusively composed of 
carbon atoms or may be composed of a mixture of carbon atoms and heteroatoms. 
Preferred heteroatoms include nitrogen, oxygen, sulfur, and phosphorous. Aryl 
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groups may optionally include monocyclic, bicyclic, or tricyclic rings, where each 
ring has preferably five or six members. The aryl group may be substituted or 
unsubstituted. Exemplary substituents include alkyl, hydroxyl, alkoxy, aryloxy, 
sulfhydryl, alkylthio, arylthio, halo, fluoroalkyl, carboxyl, carboxyalkyl, amino, 
5 aminoalkyl, monosubstituted amino, disubstituted amino, and quaternary amino 
groups. 

By "aryl" is meant a carbocyclic aromatic ring or ring system. Unless 
otherwise specified, aryl groups are from 6 to 18 carbons. Examples of aryl 
groups include phenyl, naphthyl, biphenyl, fluorenyl, and indenyl groups. 

10 By "heteroaryl" is meant an aromatic ring or ring system that contains at 

least one ring hetero-atom (e.g., O, S, N). Unless otherwise specified, heteroaryl 
groups are from 1 to 9 carbons.. Heteroaryl groups include furanyl, thienyl, 
pyrrolyl, imidazolyl, pyrazolyl, oxazolyl, isoxazolyl, thiazolyl, isothiazolyl, 
triazolyl, oxadiazolyl, oxatriazolyl, pyridyl, pyridazyl, pyrimidyl, pyrazyl, triazyl, 

15 benzofuranyl, isobenzofuranyl, benzothienyl, indole, indazolyl, indolizinyl, 

benzisoxazolyl, quinolinyl, isoquinolinyl, cinnolinyl, quinazolinyl, naphtyridinyl, 
phthalazinyl, phenanthrolinyl, purinyl, and carbazolyl groups. 

By "heterocycle" is meant a non-aromatic ring or ring system that contains 
at least one ring heteroatom (e.g., O, S, N). Unless otherwise specified, 

20 heterocyclic groups are from 1 to 9 carbons. Heterocyclic groups include, for 
example, dihydropyrrolyl, tetrahydropyrrolyl, piperazinyl, pyranyl, 
dihydropyranyl, tetrahydropyranyl, tetrahydrofuranyl, dihydrothiophene, 
tetrahydrothiophene, and morpholinyl groups. 

By "halide" or "halogen" or "halo" is meant bromine, chlorine, iodine, or 

25 fluorine. 

The aryl, heteroaryl, and heterocyclyl groups may be unsubstituted or 
substituted by one or more substituents selected from the group consisting of C1.5 
alkyl, hydroxy, halo, nitro, C U5 alkoxy, C1.5 alkylthio, trihalomethyl, C|. 5 acyl, 
arylcarbonyl, heteroarylcarbonyl, nitrile, C|. 5 alkoxycarbonyl, oxo, arylalkyl 
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(wherein the alkyl group has from 1 to 5 carbon atoms) and heteroarylalkyl 
(wherein the alkyl group has from 1 to 5 carbon atoms). 

By "biased phosphopeptide library" is meant a phosphoserine, 
phosphothreonine, and/or phosphotyrosine degenerate peptide library, wherein 
5 specific amino acid residues of the phosphopeptide are fixed so as to be expressed 
in all phosphopeptides in the specific library. For instance, a biased 
phosphopeptide library can be synthesized to contain the core sequence Ser-pSer- 
Pro or Ser-pThr-Pro. In a desirable embodiment, the amino acid residue adjacent 
to the phosphoserine, phosphothreonine, or phosphotyrosine residue is fixed. 

10 By an "amino acid fragment" is meant an amino acid residue that has been 

incorporated into a peptide chain via its alpha carboxyl, its alpha nitrogen, or both. 
A terminal amino acid is any natural or unnatural amino acid residue at the amino- 
terminus or the carboxy-terminus. An internal amino acid is any natural or 
unnatural amino acid residue that is not a terminal amino acid. 

15 As used herein, the terms "alkyl" and the prefix "alk-" are inclusive of both 

straight chain and branched chain groups and of cyclic groups, i.e., cycloalkyl and 
cycloalkenyl groups. Cyclic groups can be monocyclic or polycyclic and 
preferably have from 3 to 8 ring carbon atoms, inclusive. Exemplary cyclic 
groups include cyclopropyl, cyclopentyl, cyclohexyl, and adamantyl groups. 

20 By "aromatic residue" is meant an aromatic group having a ring system 

with conjugated n electrons (e.g., phenyl or imidazole). The ring of the aryl group 
is preferably 5 to 6 atoms. The aromatic ring may be exclusively composed of 
carbon atoms or may be composed of a mixture of carbon atoms and heteroatoms. 
Preferred heteroatoms include nitrogen, oxygen, sulfur, and phosphorous. Aryl 

25 groups may optionally include monocyclic, bicyclic, or tricyclic rings, where each 
ring has preferably five or six members. The aryl group may be substituted or 
unsubstituted. Exemplary substituents include alkyl, hydroxyl, alkoxy, aryloxy, 
sulfhydryl, alkylthio, arylthio, halo, fluoroalkyl, carboxyl, carboxyalkyl, amino, 
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aminoalkyl, monosubstituted amino, di substituted amino, and quaternary amino 
groups. 

By "aryl" is meant a carbocyclic aromatic ring or ring system. Unless 
otherwise specified, aryl groups are from 6 to 18 carbons. Examples of aryl 
5 groups include phenyl, naphthyl, biphenyl, fluorenyl, and indenyl groups. 

By "BRCA1 nucleic acid" is meant a nucleic acid, or analog thereof, that 
encodes BRCA1 or is substantially identical to Gene Bank Accession No: 
30039658. 

By "BRCA1 polypeptide" is meant a polypeptide, or analog thereof, 
10 substantially identical to BRCA1 Genbank Accession NO. 30039659 and having 
BRCA1 biological activity. 

By "BRCA1 biological activity" is meant function in a DNA damage 
response pathway or phosphopeptide binding. 

By "BRCT nucleic acid is meant a nucleic acid, or nucleic acid analog, that 
15 encodes tandem BRCT domains. For example, a nucleic acid substantially 
identical to PTIP BC03378U 2 17074571, or NM_007349 (PAX transcription 
activation domain interacting protein 1 mRNA) or Gene Bank Accession No: 
AY27380U 300396581. 

By "tandem BRCT polypeptide is meant a protein having at least 2 tandem 
20 BRCT domains. For example, a protein substantially identical to AAH33781 , 
NP_031375, or Genbank Accession NO- 30039659. 

By "candidate compound" is meant any nucleic acid molecule, polypeptide, 
or other small molecule, that is assayed for its ability to alter gene or protein 
expression levels, or the biological activity of a gene or protein by employing one 
25 of the assay methods described herein. Candidate compounds include, for 
example, peptides, polypeptides, synthesized organic molecules, naturally 
occurring organic molecules, nucleic acid molecules, and components thereof. 

By "detectably-labeled" is meant any means for marking and identifying 
the presence of a molecule, e.g., a PBD-interacting phosphopeptide, a PBD, a 
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nucleic acid encoding the same, or a peptidomimetic small molecule. Methods for 
detectably-labeling a molecule are well known in the art and include, without 
limitation, radionuclides (e.g., with an isotope such as 32 P, 33 P, 125 I, or 35 S) and 
nonradioactive labeling (e.g., chemiluminescent labeling or fluorescein labeling). 
5 If required, molecules can be differentially labeled using markers that can 

distinguish the presence of multiply distinct molecules. For example, a PBD 
domain-interacting phosphopeptide can be labeled with fluorescein and a PBD 
domain polypeptide can be labeled with Texas Red. The presence of the 
phosphopeptide can be monitored simultaneously with the presence of the PBD. 

10 By "diseases or disorder characterized by inappropriate cell cycle control" 

is meant any pathological condition in which there is an abnormal increase or 
decrease in cell proliferation. Exemplary diseases or disorder characterized by 
inappropriate cell cycle control include cancer or neoplasms, inflammatory 
diseases, or hyperplasias (e.g. some forms of hypertension, prostatic hyperplasia). 

1 5 By "disease or disorder characterized by inappropriate cell death" is meant 

any pathological condition in which there is an abnormal increase in apoptosis. 
Exemplary diseases or disorders characterized by inappropriate cell death include 
neurodegenerative diseases (e.g., Alzheimer's, Huntington's, and Parkinson's 
disease), cardiac disorders (e.g., congestive heart failure and myocardial 

20 infarction), diabetic retinopathy, and age-related macular degeneration. 

By "fragment" is meant a portion of a protein (50, 100, 150, 175, 200, 300, 
or 400 amino acids) or nucleic acid (50, 100, 150, 175, 200, 300, or 400 nucleic 
acids) that is substantially identical to a reference protein or nucleic acid, and 
retains at least 50% or 75%, more preferably 80%, 90%, or 95%, or even 99% of 

25 the biological activity of the reference protein or nucleic acid using a molting 
assay as described herein. 

By "heteroaryl" is meant an aromatic ring or ring system that contains at 
least one ring hetero-atom (e.g., O, S, N). Unless otherwise specified, heteroaryl 
groups are from 1 to 9 carbons.. Heteroaryl groups include furanyl, thienyl, 
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pyrrolyl, imidazolyl, pyrazolyl, oxazolyl, isoxazolyl, thiazolyl, isothiazolyl, 
triazolyl, oxadiazolyl, oxatriazolyl, pyridyl, pyridazyl, pyrimidyl, pyrazyl, triazyl, 
benzofuranyl, isobenzofuranyl, benzothienyl, indole, indazolyl, indolizinyl, 
benzisoxazolyl, quinolinyl, isoquinolinyl, cinnolinyl, quinazolinyl, naphtyridinyl, 
5 phthalazinyl, phenanthrolinyl, purinyl, and carbazolyl groups. 

By "heterocycle" is meant a non-aromatic ring or ring system that contains 
at least one ring heteroatom (e.g., O, S, N). Unless otherwise specified, 
heterocyclic groups are from 1 to 9 carbons. Heterocyclic groups include, for 
example, dihydropyrrolyl, tetrahydropyrrolyl, piperazinyl, pyranyl, 
10 dihydropyranyl, tetrahydropyranyl, tetrahydrofuranyl, dihydrothiophene, 
tetrahydrothiophene, and morpholinyl groups. 

By "halide" or "halogen" or "halo" is meant bromine, chlorine, iodine, or 
fluorine. 

The aryl, heteroaryl, and heterocyclyl groups may be unsubstituted or 
15 substituted by one or more substituents selected from the group consisting of C]_5 
alkyl, hydroxy, halo, nitro, Ci_ 5 alkoxy, C\. 5 alkylthio, trihalomethyl, C^acyl, 
arylcarbonyl, heteroarylcarbonyl, nitrile, Ci^ alkoxycarbonyl, oxo, arylalkyl 
(wherein the alkyl group has from 1 to 5 carbon atoms) and heteroarylalkyl 
(wherein the alkyl group has from 1 to 5 carbon atoms). 
20 By "isolated polynucleotide" is meant a nucleic acid (e.g., a DNA) that is 

free of the genes which, in the naturally-occurring genome of the organism from 
which the nucleic acid molecule of the invention is derived, flank the gene. The 
term therefore includes, for example, a recombinant DNA that is incorporated into 
a vector; into an autonomously replicating plasmid or virus; or into the genomic 
25 DNA of a prokaryote or eukaryote; or that exists as a separate molecule (for 

example, a cDNA or a genomic or cDNA fragment produced by PCR or restriction 
endonuclease digestion) independent of other sequences. In addition, the term 
includes an RNA molecule which is transcribed from a DNA molecule, as well as 
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a recombinant DNA which is part of a hybrid gene encoding additional 
polypeptide sequence. 

By "isolated polypeptide" is meant a polypeptide of the invention that has 
been separated from components which naturally accompany it. Typically, the 
5 polypeptide is isolated when it is at least 60%, by weight, free from the proteins 
and naturally-occurring organic molecules with which it is naturally associated. 
Preferably, the preparation is at least 75%, more preferably at least 90%, and most 
preferably at least 99%, by weight, a polypeptide of the invention. An isolated 
polypeptide of the invention may be obtained, for example, by extraction from a 

10 . natural source, by expression of a recombinant nucleic acid encoding such a 

polypeptide; or by chemically synthesizing the protein. Purity can be measured by 
any appropriate method, for example, column chromatography, polyacrylamide 
gel electrophoresis, or by HPLC analysis. 

By "modulate" is meant a change, such as a decrease or increase. 

15 Desirably, the change is either an increase or a decrease of at least 10%, 20%, 
30%, 40%, 50%, 60%, 70%, 80%, 90% or 95% in expression or biological 
activity, relative to a reference or to control expression or activity, for example the 
expression or biological activity of a naturally occurring Polo-like kinase. 
By "neoplasia" is meant a disease characterized by the pathological 

20 proliferation of a cell or tissue and its subsequent migration to or invasion of other 
tissues or organs. Neoplasia growth is typically uncontrolled and progressive, and 
occurs under conditions that would not elicit, or would cause cessation of, 
multiplication of normal cells. Neoplasias can affect a variety of cell types, 
tissues, or organs, including but not limited to an organ selected from the group 

25 consisting of bladder, bone, brain, breast, cartilage, glia, esophagus, fallopian tube, 
gallbladder, heart, intestines, kidney, liver, lung, lymph node, nervous tissue, 
ovaries, pancreas, prostate, skeletal muscle, skin, spinal cord, spleen, stomach, 
testes, thymus, thyroid, trachea, urogenital tract, ureter, urethra, uterus, and 
vagina, or a tissue or cell type thereof. Neoplasias include cancers, such as 



-28- 



sarcomas, carcinomas, or plasmacytomas (e.g., acute lymphocytic leukemia, acute 
myelocytic leukemia, acute myeloblasts leukemia, acute promyelocyte leukemia, 
acute myelomonocytic leukemia, acute monocytic leukemia, acute 
erythroleukemia, chronic leukemia, chronic myelocytic leukemia, chronic 
5 lymphocytic leukemia, polycythemia vera, lymphoma Hodgkin's disease, 

Waldenstrom's macroglobulinemia, fibrosarcoma, myxosarcoma, liposarcoma, 
chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, 
endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, 

10 colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, 
squamous cell carcinoma, basal cell carcinoma, sweat gland carcinoma, sebaceous 
gland carcinoma, papillary carcinoma, papillary adenocarcinomas, 
cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell 
carcinoma, hepatoma, nile duct carcinoma, choriocarcinoma, seminoma, 

15 embryonal carcinoma, Wilm's tumor, cervical cancer, uterine cancer, testicular 
cancer, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial 
carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, 
ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, 
oligodenriglioma, schwannoma, meningioma, melanoma, neuroblastoma, or 

20 retinoplastoma). 

By "nucleic acid" is meant an oligomer or polymer of ribonucleic acid or 
deoxyribonucleic acid, or analog thereof. This term includes oligomers consisting 
of naturally occurring bases, sugars, and intersugar (backbone) linkages as well as 
oligomers having non-naturally occurring portions which function similarly. Such 

25 modified or substituted oligonucleotides are often preferred over native forms 

because of properties such as, for example, enhanced cellular uptake and increased 
stability in the presence of nucleases. 

Specific examples of some preferred nucleic acids envisioned for this 
invention may contain phosphorothioates, phosphotriesters, methyl phosphonates, 
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short chain alkyl or cycloalkyl intersugar linkages or short chain heteroatomic or 
heterocyclic intersugar linkages. Most preferred are those with CH 2 -NH — O — 
CH 2 , CH 2 — N(CH 3 )— O— CH 2 , CH 2 — O— N(CH 3 )— CH 2 , CH 2 — N(CH 3 )— 
N(CH 3 )— CH 2 and O— N(CH 3 )— CH 2 — CH 2 backbones (where phosphodiester is 
5 O — P — O — CH 2 ). Also preferred are oligonucleotides having morpholino 

backbone structures (Summerton, J.E. and Weller, D.D., U.S. Pat. No: 5,034,506). 
In other preferred embodiments, such as the protein-nucleic acid (PNA) backbone, 
the phosphodiester backbone of the oligonucleotide may be replaced with a 
polyamide backbone, the bases being bound directly or indirectly to the aza 

10 nitrogen atoms of the polyamide backbone (P.E. Nielsen et al. Science 199: 254, 
1997). Other preferred oligonucleotides may contain alkyl and halogen- 
substituted sugar moieties comprising one of the following at the 2' position: OH, 
SH, SCH 3 , F, OCN, 0(CH 2 ) n NH 2 or 0(CH 2 ) n CH 3 , where n is from 1 to about 10; 
Ci to Cio lower alkyl, substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF 3 ; 

15 OCF 3 ; 0-, S-, or N-alkyl; 0-, S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ; N0 2 ; N 3 ; 
NH 2 ; heterocycloalkyl; heterocycloalkaryl; aminoalkylamino; polyalkylamino; 
substituted silyl; an RNA cleaving group; a conjugate; a reporter group; an 
intercalator; a group for improving the pharmacokinetic properties of an 
oligonucleotide; or a group for improving the pharmacodynamic properties of an 

20 oligonucleotide and other substituents having similar properties. Oligonucleotides 
may also have sugar mimetics such as cyclobutyls in place of the pentofuranosyl 
group. 

Other preferred embodiments may include at least one modified base form. 
Some specific examples of such modified bases include 2-(amino)adenine, 2- 
25 (methylamino)adenine, 2-(imidazolylalkyl)adenine, 2-(aminoalklyamino)adenine, 
or other heterosubstituted alkyladenines. 

By "Pax2 /ra^s-activation domain-interacting protein (PTIP) nucleic acid" 
is meant a nucleic acid, or analog thereof, substantially identical to Genebank 
Accession No:21707457 or NM_007349. 
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By "Pax2 /ra/7s-activation domain-interacting protein (PTIP)" is meant a 
polypeptide, or analog thereof, substantially identical to Genebank Accession No: 
AAH33781.1or NP_031375, and having PTIP biological activity. 

By "PTIP biological activity" is meant function in a DNA damage response 
5 pathway or phosphopeptide binding. 

By "pharmaceutically acceptable excipient" is meant a carrier that is 
physiologically acceptable to the subject to which it is administered and that 
preserves the therapeutic properties of the compound with which it is 
administered. One exemplary pharmaceutically acceptable excipient is 
10 physiological saline. Other physiologically acceptable excipients and their 
formulations are known to one skilled in the art and described, for example, in 
" Remington: The Science and Practice of Pharmacy " (20th ed., ed. A.R. Gennaro 
AR., 2000, Lippincott Williams & Wilkins). 

By a "peptidomimetic" is meant a compound that is capable of mimicking 
15 or antagonizing the biological actions of a natural parent peptide. A 

peptidomimetic may include non-peptidic structural elements, unnatural peptides, 
synthesized organic molecules, naturally occurring organic molecules, nucleic acid 
molecules, and components thereof. Identification of a peptidomimetic can be 
accomplished by screening methods incorporating a binding pair and identifying 
20 compounds that displace the binding pair. Alternatively, a peptidomimetic can be 
designed in silico, by molecular modeling of a known protein-protein interaction, 
for example, the interaction of a phosphopeptide of the invention and a PBD. 
Desirably, the peptidomimetic will displace one member of a binding pair by 
occupying the same binding interface. More desirably the peptidomimetic will 
25 have a higher binding affinity to the binding interface. 

By "Polo-like kinase (PLK) nucleic acid molecule" is meant a nucleic acid, 
or nucleic acid analog, that encodes a Polo-like kinase polypeptide. For example, 
a Plk-1 nucleic acid molecule is substantially identical to GenBank Accession 
Number X73458 or NMJ)05030; a Plk-2/SNK nucleic acid molecule is 
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substantially identical to NM_006622; a Plk-3 nucleic acid molecule is 
substantially identical to NM_004073; a Plx-1 nucleotide sequence is substantially 
identical to GenBank Accession Number U58205; and a Polo nucleic acid 
molecule is substantially identical to GenBank Accession Number AY095028 or 
5 NM_079455. 

By a "Polo-like kinase" is meant a polypeptide substantially identical to a 
Polo-like kinase amino acid sequence, having serine/threonine kinase activity, and 
having at least one Polo-box domain consisting of 2 Polo-boxes. Exemplary Polo- 
like kinase polypeptides include, Plk-1 (GenBank Accession Number NP_005021, 
10 SEQIDNO:l); Plk-2 (GenBank Accession Number NP_0066 13, SEQIDNO:4); 
and Plk-3 (GenBank Accession Number NP_004064, SEQ ID NO: 5). Additional 
Polo-like kinase polypeptides include GenBank Accession Numbers P53350, and 
Q07832. 

Structurally, Polo or Polo-like kinases have a unique amino terminus 
15 followed by a serine/threonine kinase domain, a linker region, a Polo-box (PB1), a 
linker sequence, a second Polo-box (PB 2), and a small stretch of 12-20 amino 
acids at the carboxy terminus (see Figure 2A). 

In desirable embodiments, Polo-like kinases include Saccaromyces 
cereviseae, Cdc5, Schizosaccaromyces pombe, Plo-1, Drosophila melanogaster, 
20 Polo, Xenopus laevis, Plx (Plx-1, -2, -3), and mammalian Plk-1, Prk/Fnk, Snk, and 
Cnk. The Polo-box is approximately 70 amino acids in length and is shown in 
Figure 2B (indicated by the bold lines). 

By "Polo-like kinase biological activity" is meant any biological activity 
associated with Polo-like kinases, such as serine/threonine kinase activity. Other 
25 biological activities of Polo-like kinases include the localization of the kinase to 
the centrosomes, spindle apparatus, and microtubular organizing centers (MOCs). 

By "polypeptide" is meant any chain of at least two naturally-occurring 
amino acids, or unnatural amino acids (e.g., those amino acids that do not occur in 
nature) regardless of post-translational modification (e.g., glycosylation or 
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phosphorylation), constituting all or part of a naturally-occurring or unnatural 
polypeptide or peptide, as is described herein. Naturally occurring amino acids are 
any one of the following, alanine (A or Ala), cysteine (C or Cys), aspartic acid (D 
or Asp), glutamic acid (E or Glu), phenylalanine (F or Phe), glycine (G or Gly), 
5 histidine (H, or His), isoleucine (I or He), lysine (K or Lys), leucine (L or Leu), 
methionine (M or Met), asparagine (N or Asn), ornithine (O or Orn), proline (P or 
Pro), hydroxyproline (Hyp), glutamine (Q or Gin), arginine (R or Arg), serine (S 
or Ser), threonine (T or Thr), valine (V or Val), tryptophan (W or Trp), or tyrosine 
(YorTyr). 

10 By "peptide" is meant any compound composed of amino acids, amino acid 

analogs, chemically bound together. In general, the amino acids are chemically 
bound together via amide linkages (CONH); however, the amino acids may be 
bound together by other chemical bonds known in the art. For example, the amino 
acids may be bound by amine linkages. Peptide as used herein includes oligomers 

15 of amino acids, amino acid analog, or small and large peptides, including 
polypeptides. 

Polypeptides or derivatives thereof may be fused or attached to another 
protein or peptide, for example, as a Glutathione-S-Transferase (GST) fusion 
polypeptide. Other commonly employed fusion polypeptides include, but are not 

20 limited to, maltose-binding protein, Staphylococcus aureus protein A, Flag-Tag, 
HA-tag, green fluorescent proteins (e.g., eGFP, eYFP, eCFP, GFP, YFP, CFP), 
red fluorescent protein, polyhistidine (6xHis), and cellulose-binding protein. 

By "phosphopeptide" or "phosphoprotein" means a peptide or protein in 
which one or more phosphate moieties are covalently linked to serine, threonine, 

25 tyrosine, aspartic acid, histidine amino acid residues, or amino acid analogs. A 
peptide can be phosphorylated to the extent of the number of serine, threonine, 
tyrosine, or histidine amino acid residues that is present. Desirably, a 
phosphopeptide is phosphorylated at 4 independent Ser/Thr/Tyr residues, at 3 
independent Ser/Thr/Tyr residues, or at 2 independent Ser/Thr/Tyr residues. Most 
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desirably, a phosphopeptide is phosphorylated at one Ser/Thr/Tyr residue 
regardless of the presence of multiple Ser, Thr, or Tyr residues. 

Typically, a phosphopeptide is produced by expression in a prokaryotic or 
eukaryotic cell under appropriate conditions or in translation extracts where the 
5 peptide is subsequently isolated, and phosphorylated using an appropriate kinase. 
Alternatively, a phosphopeptide may be synthesized by standard chemical 
methods, for example, using N-a-FMOC-protected amino acids (including 
appropriate phosphoamino acids). In a desired embodiment, the use of non- 
hydrolysable phosphate analogs can be incorporated to produce non-hydrolysable 

10 phosphopeptides (Jenkins et al., J. Am. Chem. Soc, 124:6584-6593, 2002; herein 
incorporated by reference). Such methods of protein synthesis are commonly used 
and practiced by standard methods in molecular biology and protein biochemistry 
(Ausubel et aL, Current Protocols in Molecular Biology , John Wiley & Sons, New 
York, NY, 1994, J. Sambrook and D. Russel, Molecular Cloning: A Laboratory 

1 5 Manual , 3 rd Edition, Cold Spring Harbor Laboratory Press, Woodbury NY, 2000). 
Desirably, a phosphopeptide employed in the invention is generally not longer 
than 100 amino acid residues in length, desirably less than 50 residues, more 
desirably less than 25 residues, 20 residues, 15 residues. Most desirably the 
phosphopeptide is 2, 3, 4, 5, 6, 7, 8, 9, or 10 amino acid residues long. 

20 By "substantially identical" is meant a polypeptide or nucleic acid 

exhibiting at least 75%, but preferably 85%, more preferably 90%, most preferably 
95%, or even 99% identity to a reference amino acid or nucleic acid sequence. 
For polypeptides, the length of comparison sequences will generally be at least 35 
amino acids, preferably at least 45 amino acids, more preferably at least 55 amino 

25 acids, and most preferably 70 amino acids. For nucleic acids, the length of 

comparison sequences will generally be at least 60 nucleotides, preferably at least 
90 nucleotides, and more preferably at least 120 nucleotides. 

Sequence identity is typically measured using sequence analysis software 
with the default parameters specified therein (e.g., Sequence Analysis Software 
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Package of the Genetics Computer Group, University of Wisconsin Biotechnology 
Center, 1710 University Avenue, Madison, WI 53705). This software program 
matches similar sequences by assigning degrees of homology to various 
substitutions, deletions, and other modifications. Conservative substitutions 
5 typically include substitutions within the following groups: glycine, alanine, 
valine, isoleucine, leucine, methionine; aspartic acid, glutamic acid, asparagine, 
glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. 

By "unnatural amino acid 1 ' is meant an organic compound that has a 
structure similar to a natural amino acid, where it mimics the structure and 
10 reactivity of a natural amino acid. The unnatural amino acid as defined herein 
generally increases or enhances the properties of a peptide (e.g., selectivity, 
stability, binding affinity) when the unnatural amino acid is either substituted for a 
natural amino acid or incorporated into a peptide. 

Unnatural amino acids and peptides including such amino acids are 
15 described in U.S. Patent No. 6,566,330 and 6,555,522. 

Other features and advantages of the invention will be apparent from the . 
following description of the desirable embodiments thereof, and from the claims. 

Brief Description of the Drawings 
20 The application file contains drawings executed in color (Figures 10, 11, 

12, 14, and 21). Copies of this patent or patent application with color drawings 
will be provided by the Office upon request and payment of the necessary fee. 

Figures 1A and IB depict a novel phospho-motif-based library vs. library 
screen to identify phosphoserine/threonine binding domains. Figure 1 A depicts a 
25 library of phosphothreonine-proline oriented phosphopeptides, biased toward the 
phosphorylation motifs for cyclin-dependent kinases and MAP kinases and toward 
the epitope of the monoclonal antibody MPM-2, and immobilized on Streptavidin 
beads. This library and its unphosphorylated counterpart were screened against 
680 pools of in vitro translated 35 S-Met labeled proteins. pT denotes 
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phosphothreonine. B represents a biased mixture of the amino acids P, L, I, V, F, 
M, W. Figure IB is a set of four SDS-PAGE/autoradiographs. The WW-domain 
containing protein Pinl and a fragment of the mitotic kinase Plk-1, denoted by 
asterisks, were isolated from two pools as clones that associated preferentially 
5 with the phosphorylated form of the immobilized peptide library. In each panel, 
the first lane shows 10% of the input radiolabeled protein pool, while the second 
and third lanes show binding of proteins within this pool to the phosphorylated 
and unphosphorylated immobilized libraries, respectively. Identification of Pinl 
and Plkl occurred through progressive subdivision of their respective pools to 

10 single clones (panels on right). Arrowheads indicate partial translation or 
proteolytic breakdown products of Plkl that exhibit more dramatic phospho- 
discrimination than the full-length transcript of the isolated Plkl fragment, 
suggesting that the full-length transcript likely contains a smaller discrete 
phospho-binding domain. 

15 Figure 2 A is a schematic diagram showing various C-terminal truncations 

of Plk-1, translated in vitro, and assayed for selective binding to the 
phosphorylated peptide library of Figure 1 A over its unphosphorylated 
counterpart. The two shaded regions in the C-terminus of Plk-1 correspond to its 
polo boxes (PB1 and PB2) as defined by Pfam. Truncated constructs were 

20 designed according to boundaries of sequence homology within the polo-like 
kinase family rather than boundaries of the Pfam-delineated polo boxes. Clone 
407-C6 is the fragment of Plk-1 isolated from the screen depicted in Figures 1A 
andB. 

Figure 2B shows an amino acid sequence alignment of the C-terminal 
25 noncatalytic region of human Plk- 1 , Xenopus Plx- 1 , and Drosophila Polo. Bold 
lines indicate the designated polo boxes (PB1 and PB2) of Plk-1 as defined by 
Pfam. 

Figures 3A-3D are histograms showing the binding ratios of the Plk-1 polo- 
box domain (PBD). The Polo-box Domain (PBD, residues 326-603) of Plk-1 was 
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expressed as a GST fusion protein, immobilized on Glutathione-agarose beads, 
and incubated with phosphothreonine/serine-oriented degenerate peptide libraries 
consisting of the sequences MAXXXXpTPXXXXAKK (SEQ ID NO:l 1) (3 A), 
MAXXXXpSPXXXXAKK (SEQ ID NO: 12) (3B), MAXXXXSpTXXXXAKK 
5 (SEQ ID NO: 13) (3C), or MAXXXXSpSXXXXAKK (SEQ ID NO: 14) (3D) 
where X indicates all amino acids except Cys. Following extensive washing, 
bound peptides were eluted and sequenced. The bar graphs show the relative 
abundance of each amino acid at a given cycle of sequencing compared to its 
abundance in the starting peptide library mixture. The Plk-1 PBD selects for 

10 serine in the pThr/Ser-1 position strongly (5.9 or 8.1) and for proline in the 
pThr/Ser+1 position moderately (1.6 or 1.8). 

Figure 3E is an autoradiograph. Pinl (3E) shows an absolute requirement 
for proline in the pThr+1 position, whereas the PBD of Plk-1 does not. Full-length 
Pinl and the PBD (residues 326-603) of Plk-1 were translated in vitro in the 

15 presence of S-methionine and tested for binding to four immobilized peptide 
libraries that differed by phosphorylation status and/or the presence of proline in 
the pThr+1 position. 

pTP- biotin-ZGZGG AXXBXpTPXXXXAKKK (SEQ ID NO: 1 5), 
TP= biotin-ZGZGGAXXBXTPXXXXAKKK (SEQ ID NO: 16), 
20 pT= biotin-ZGZGGAXXXXpTXXXXXAKKK (SEQ ID NO: 1 7), 
T- biotin-ZGZGGAXXXXTXXXXXAKKK (SEQ ID NO: 18), 
where pT is phosphothreonine, Z indicates aminohexanoic acid, X denotes all 
amino acids except Cys, and B is a biased mixture of the amino acids P, L, I, V, F, 
M, W. 

25 Figure 4A shows isothermal titration calorimetry results. These results 

show that Plkl PBD binds its optimal phosphopeptide ligand with high affinity 
and high specificity. 

Figure 4B is a table. Isothermal titration calorimetry (ITC) was used to 
determine binding constants (Kd) for the association of the Plk-1 PBD (residues 
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326-603) with its optimal phosphopeptide ligand and with nine mutated versions 
of this peptide. All observed binding stoichiometrics were consistent with a 1:1 
complex of PBD and phosphopeptide. N.D.B indicates no detectable binding by 
ITC for a Plk-1 PBD concentration of at least 150|nM. pT, pS, and pY denote 
5 phosphothreonine, phosphoserine, and phosphotyrosine, respectively. 

Figures 5 A upper panel shows a FACS (fluorescence activated cell sorter) 
trace of human cells used in the pull-down assays shown below. The upper left 
panel shows the FACS profile of the cells arrested with aphidocolin in Gl (so the 
total DNA content is IN where N = the normal amount of DNA in a diploid 

10 human cell) and verifies that the cells were in Gl . The right trace shows the 
FACS profile of the cells arrested with nocadozole to trap them in G2/M, and 
shows that their DNA content is 2N, verifying that they are arrested in G2/M. 
Figures 5 A (lower panel) and 5B are immunoblots showing that the Plk-1 PBD 
associates with mitotic phosphoproteins in HeLa cells. Lysates from HeLa cells, 

15 arrested at interphase with aphidicolin or in G2/M with nocodazole, were 
incubated with GST, GST-Pin 1 5 and the GST-Plk-1 PBD (residues 326-603; 
Figure 5A). Mitotic phosphoproteins co-precipitated with these GST fusions were 
detected by blotting with the pSer-Pro specific monoclonal antibody MPM-2. 
Interaction of the GST-Plk-1 PBD (residues 326-603) with mitotic 

20 phosphoproteins from nocodazole-arrested HeLa cells was disrupted by pre- 
incubation of GST-Plk-1 PBD with its optimal phosphopeptide ligand, 
MAGPMQ-S-pT-P-LNGAKK (SEQ ID NO: 19) (PoloBoxtide-optimal), but not 
with an unphosphorylated equivalent peptide, MAGPMQ-S-T-P-LNGAKK (SEQ 
ID NO:20) (PoloBoxtide-8T), nor a phosphopeptide whose serine at pThr-1 was 

25 mutated to valine (PoloBoxtide-7V; Figure 5B). 

Figures 6A, 6C, and 6D are immunoblots showing that Plk-1 PBD interacts 
with Thr| 30 of mitosis-dependent phosphorylated Cdc25C from HeLa cells. Figure 
6A is an anti-CDC25 western blot on lysates from HeLa cells arrested in 
interphase with aphidicolin or in G2/M with nocodazole, incubated with a GST 
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fusion of the Plk-1 PBD (residues 326-603). Endogenous Cdc25C from mitotic 
lysates was precipitated with GST-Plk-1 PBD and detected by anti-Cdc25C (Santa 
Cruz Biotechnology). Interaction of GST-Plk-1 PBD with Cdc25C was disrupted 
as in Figure 5B by pre-incubation of GST-Plk-1 PBD with its optimal 
5 phosphopeptide ligand (PoloBoxtide-optimal) but not with the PoloBoxtides-8T or 
-7V. Figure 6B is a sequence alignment showing that a consensus motif for the 
Polo-box Domain of Plk-1 is conserved between human and Xenopus Cdc25C. 
T130 and T138 of human and Xenopus Cdc25C, respectively, are known to be 
phosphorylated during mitosis (Figure 6B). Lysates were prepared from HeLa 

10 cells transfected with either wild type, T130A, or S129V HA-Cdc25C (human), 
arrested in G2/M with nocodazole, and normalized for equal loading of the 
mitotically up-shifted form. Interaction of GST-Plk-1 PBD (residues 326-603) 
with mitotically phosphorylated Cdc25C from these lysates was detected by pull- 
down with glutathione beads, separation by 1 1 .4% SDS-PAGE and anti-HA 

15 blotting (Figure 6C). Figure 6D shows lysates, analyzed by 9% SDS-PAGE to 
enhance separation of the hyper-phosphorylated (P) form of Cdc25C from 
partially phosphorylated and unphosphorylated (U) forms. 

Figure 7A is a set of micrographs visualized using fluorescence 
microscopy. Figure 7B is a histogram showing the ratio of centrosomal 

20 localization by the GST-PBD relative to centrosomal y-tubulin. U20S cells were 
arrested in G2/M with nocodazole and then incubated with 4 |iM GST-Plk-1 PBD 
(residues 326-603) in cell permeabilization buffer containing 1 U /ml Streptolysin- 
O in the presence of no peptide (upper panel), 250 |iM of the optimal 
phosphopeptide (optimal, middle panel), or 250 jiM of the corresponding 

25 unphosphorylated analogue (8T, lower panel). Following incubation, the cells 
were washed extensively, fixed with paraformaldehyde, extracted with Triton X- 
100, immunostained for GST and y-tubulin, and counterstained with DAPI to 
visualize the nucleus. Overlap of the GST (Alexa Fluor 488) and y-tubulin (Texas 
Red) signals is shown in the merged figure in the far right column (Figure 7A). 
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The ratio of centrosomal localization by the GST-PBD relative to centrosomal y- 
tubulin levels is shown in Figure 7B. 

Figure 8 is a schematic diagram showing a model for 2-step activation of 
Cdc25 and Cdc2/Cyclin B auto-activation through Plk-1. Phosphorylation of a 
5 few molecules of Cdc25, either by a small amount of de-repressed Cdc2/Cyclin B 
or another proline-directed kinase early in mitosis, primes those Cdc25 molecules 
for binding of Plk-1 through its PBD. Activation of the Plk-1 kinase domain by 
Plkkl generates the first wave of Cdc25 activation, dephosphorylating more 
Cdc2/Cyclin B, which, in turn, phosphorylates additional Cdc25 molecules for 

10 interaction with the Plk-1 PBD. The net result is a positive feedback loop for 
Cdc2/Cyclin B activation (circled). 

Figure 9A is a table showing the conservative mutations at the pT-1 serine 
that abolish Plkl PBD / peptide binding in solution. Isothermal titration 
calorimetry was used to determine binding affinities. The Plkl PBD (residues 

15 326-603) was expressed in Exoli as a GST fusion, purified on glutathione agarose, 
proteolytically digested from GST, and further purified by anion exchange 
chromatography. N.D.B. indicates no detectable binding for a Plkl PBD 
concentration of at least 150 jiM. pT denotes phosphothreonine. Throughout 
Figures 9A and 9B, the domains are depicted as follows: kinase: white; PC: gray; 

20 PB1: red; PB2: blue; 

i 

Figure 9B is a filter array that shows binding of GST-Plkl PBD (residues 
326-603) to peptide spots, comprising single point mutants of the Plkl PBD 
optimal phosphopeptide (right column). Bound GST-Plkl was detected by 
blotting with HRP-conjugated anti-GST antibody. 
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Figure 1 OA is a schematic diagram showing the boundaries of the PBD by 
limited proteolysis. Domain architecture of full-length Plkl and stable fragments 
(left) are shown together with the time-course of V8 protease digestion (right). 
Molecular weight and amino acid boundaries of the limiting domain were 
5 determined by mass spectroscopy. 

Figure 1 OB is a schematic diagram showing the Polo-box 1 and Polo-box 2 
p 6 a structures, colored as in (A), are shown superimposed. 

Figure IOC is a RIBBONS representation (Carson, 1991) of the structure of 
the Plkl PBD in complex with a phosphothreonine-containing peptide shown as a 
10 ball and stick representation in yellow. The Polo-boxes and Polo-cap region are 
colored as in (A). The phosphopeptide binds at one end of a pocket formed 
between the two polo boxes. 

Figure 1 1 A shows a structure-based sequence alignment of the Polo-box 
Domain family. Residues with 100% conservation are shaded purple while highly 
1 5 conserved residues are shaded cyan. 

Figure 1 IB is an image of the molecular surface of the PBD based on the 
structure determined by X-ray crystallography. The surface positions 
corresponding to the conserved residues are colored as in Figure 1 1 A. The most 
highly conserved residues within the Plkl PBD are located exclusively on the 
20 peptide-binding face of the PBD. The most highly conserved residues within the 
Plkl PBD are located exclusively on the peptide binding face of the PBD. The 
coloring scheme is as in 1 1 A. 

Figure 1 1C is a schematic diagram depicting the electrostatic potential of 
the PBD phosphopeptide pocket, calculated using GRASP (Nicholls et al., 1991), 
25 with the phosphopeptide superimposed in stick representation (oxygen atoms, red; 
nitrogen atoms, blue). Negative potential of the PBD surface is colored red and 
positive potential blue. 

Figure 1 ID is a schematic representation of the interactions between the 
phosphopeptide (blue) and the Plkl PBD. Hydrogen bonds, van der Waals 
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interactions, and water molecules are denoted by dotted lines, purple crescents, 
and green circles, respectively. 

Figure 1 IE is a schematic representation of direct and indirect hydrogen 
bonds (dotted lines) between the phosphate and the Plkl PBD. Hydrogen bond 
5 lengths are given in angstroms. 

Figure 12A is a schematic diagram showing a comparison of the p- 
sandwich folds of the Plkl PBD and the Sak polo-box dimer. Tertiary structures 
are shown on the top together with secondary structure topology (triangles, p 
strands; rectangles, a-helices) on the bottom. PB1 and PB2 of Plkl are denoted 

10 by red and purple colors, respectively, while the Pc of Plkl is shown in green. 

Polo-boxes from separate Sak molecules within the dimer are likewise denoted by 
red and purple. The Sak p sandwich involves strand swapping between separate 
polo-boxes within the dimer. 

Figure 12B is a sequence alignment of the Polo-boxes from Plkl and Sak. 

15 Plkl has a p6cc secondary topology while Sak has a circularly altered p5ocp 

topology, p-sheet and a-helix notation follows PB1; the corresponding elements 
for PB2 are (37 through pi2 and ocC. A conserved salt-bridging interaction 
initially observed in the Sak structural analysis (Leung et al., Nat Struct Biol. 
9:719-724, 2002) is shown by the blue bracket. Conserved non-polar residues are 

20 highlighted in blue and residues conserved between Sak and at least one of the 
Plkl PBDs are boxed. 

Figure 13 A is an autoradiograph. Wild type and mutant Plkl PBD 
(residues 326-603) were translated in vitro in the presence of 35 S-methionine and 
examined for binding to an immobilized pThr-Pro-oriented library and its 

25 unphosphorylated counterpart. pTP= biotin-ZGZGGAXXBXpTPXXXXAKKK 
SEQ ID NO:21, TP- biotin-ZGZGGAXXBXTPXXXXAKKK SEQ ID NO:22, 
where pT is phosphothreonine, Z is aminohexanoic acid, X is all amino acids 
except Cys, and B denotes a biased mixture of the amino acids P, L, I, V, F, M, W. 



Figure 13B is a diagram showing isothermal titration calorimetry results. A 
H538A/K540M mutation of the Plkl PBD abolishes binding to its optimal 
phosphopeptide as measured by isothermal titration calorimetry. 

Figure 13C is a Western blot showing that mutation of the H538/K540 
5 pincer disrupts interaction of the isolated Plkl PBD with Cdc25 in vivo. HeLa 
cells were transfected with wild type and mutant versions of a His-Xpress-tagged 
Plkl PBD construct (residues 326-603) or with a control PlklPBD construct 
lacking the second Polo-box (residues 326-506) and arrested in G2/M with 
nocodazole. The Plk PBD was pulled down with Ni 2+ beads and bound 

10 endogenous proteins analysed by SDS-PAGE and blotted for Cdc25. 

Figure 1 3D is a Western blot showing that mutation of the H538/K540 
pincer in the Plkl PBD disrupts interaction of full-length Plkl with Cdc25 in vivo. 
HeLa cells were transfected with wild type and mutant versions of full-length 
myc-tagged Plkl and arrested in G2/M with nocodazole. Plk-myc was 

15 immunoprecipitated with anti-myc-conjugated beads and Cdc25 binding to Plkl 
analyzed as in 13C. 

Figure 14 is a series of photomicrographs showing that mutation of the 
H538/K540 pincer sequence abolishes centrosomal localization of the Plkl PBD 
in HeLa Cells. U20S cells were arrested in G2/M with nocodazole and then 

20 incubated with 4 wild-type or mutant GST-Plkl PBD (residues 326-603) in 
cell permeabilization buffer containing 1 U/ml Streptolysin-O. Following 
incubation, the cells were washed extensively, fixed with paraformaldehyde, 
extracted with Triton X-100, immunostained for GST and y-tubulin, and 
counterstained with DAPI to visualize the nucleus. Overlap of the GST (Alexa 

25 Fluor 488) and y-tubulin (Texas Red) signals is shown in the merged figure in the 
far right column. 

Figure 15 is a series of diagrams showing the results of FACS analysis. 
HeLa cells were transfected with wild type and mutant GFP-tagged Plkl (residues 
326-603) for 32 hours. Cells were harvested, stained with Hoechst 33342, and 
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analyzed by FACS to determine DNA content in the total cell populations (left 
panels). Similar analysis limited to the transfected cell population was performed 
by gating only on the GFP expressing cells (right panels). G2/M population 
percentages are averages from three independent experiments. 
5 Figure 16A is a Western blot that phosphopeptide binding by full-length 

Plkl is reduced relative to that for the isolated Plkl PBD. Approximately 10% of 
input full length Plkl (residues 1-603) interacted with an immobilized pThr-Pro 
oriented library with slight preference over the unphosphorylated library analogue. 
The phosphorylation-dependent component of binding arose from the PBD, as it 

10 was eliminated by mutation of the His538/K540M pincer. In contrast, 

phosphopeptide binding by the isolated PBD (Figure 13 A) was 10-fold greater and 
considerably more phospho-dependent. 

Figure 16B is a graph showing that the optimal PBD phosphopeptide 
stimulates full-length Plkl kinase activity. GST-Plkl (prepared in SF9 cells) was 

15 preincubated without peptide (closed circles), with 250 jiM of the optimal PBD 
phosphopeptide (open squares) or with 250 jiM of the non-phosphorylated optimal 
peptide counterpart (closed squares) for 5 minutes at room temperature prior to 
initiating the kinase reaction by addition of ATP. [32P]-incorporation into casein 
was determined by SDS-PAGE electrophoresis, autoradiography, and 

20 densitometry. Preincubation with the optimal PBD phosphopeptide ligand 
enhanced the rate of casein phosphorylation by Plkl by a factor of 2.6 as 
determined from three independent experiments. 

Figure 16C is a schematic diagram depicting a model for Plkl regulation by 
the PBD. PB1 and PB2 are shaded orange, kinase domain cyan, phosphopeptide 

25 purple with phosphate in red. Inhibitory interactions between the PBD and the 
kinase domain in the basal state (left) are relieved by phosphopeptide binding, 
which may also stabilize association of the two Polo-boxes (right). 

Figure 17A is an autoradiograph showing the identification of 
phosphoSer/Thr-binding domains using an ATM/ATR-motif library. 
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An oriented (pSer/pThr) phosphopeptide library, biased toward the 
phosphorylation motifs for ATM/ATR kinases, was immobilized on Streptavidin 
beads. This phosphopeptide library [pSQ= biotin- 
ZGZGGAXXXB(pS/pT)QJXXXAKKK (SEQ ID NO:23)] and its non- 
5 phosphorylated counterpart were screened against in vitro translated 35 S-Met 

labeled proteins. (pS/pT) denotes 50% phosphoserine and 50% phosphothreonine; 
Z indicates aminohexanoic acid; B represents a biased mixture of the amino acids 
A, I, L, M, N 5 P, S 5 T, V; and J represents a biased mixture of 25% E, 75% X, 
where X denotes all amino acids except Arg, Cys, His, and Lys. PTIP, denoted by 

10 arrow, was isolated from pool EE1 1 as a clone that associated preferentially with 
the phosphorylated form of the immobilized peptide library. In each panel, the 
first and second lanes show binding of proteins within the pool to the 
phosphorylated and non-phosphorylated libraries, respectively. Identification of 
PTIP occurred through progressive subdivision of the EE1 1 pool to a single clone 

15 (panel on right denoted by asterisk). Longer exposures revealed partial translation 
or proteolytic breakdown products of PTIP that also exhibit phospho- 
discrimination, suggesting that the full-length transcript likely contains a smaller 
discrete phospho-binding domain. The uppermost band is a fusion artifact of PTIP 
with vector sequences resulting from translation initiation at an upstream ATG in 

20 the vector. 

Figure 1 7B is an autoradiograph showing deletion mapping of the phospho- 
binding domain of PTIP. Truncations of PTIP were translated in vitro and assayed 
for selective binding to the phosphorylated peptide library as in Figure 17 A. 
Shaded regions in the C-terminus of PTIP correspond to its BRCT domains. 
25 Truncation constructs were designed according to boundaries of sequence 

homology within the BRCT domain, boundaries from sequence alignments, and 
from the Pfam-delineated BRCT domains (Bateman et al. 9 Nucleic Acids Res 27: 
260-2, 1999). 
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Figure 18A is an autoradiograph. PTIP, BRCA1, MDC1, 53BP1 and Rad9 
tandem BRCT domains were translated in vitro in the presence of 35S-methionine 
and tested for binding to immobilized phosphopeptide and non-phosphopeptide 
libraries as described in Figure 17A. The peptide libraries used were pSQ as 
5 defined in Figure 1 7 A. pS= biotin-ZGZGGAXXXXpSXXXXXAKKK SEQ ID 
NO:24; P T=biotin-ZGZGGAXXXXpTXXXXXAKKK SEQ ID NO:25, where pS 
is phosphoserine, pT is phosphothreonine, Z indicates aminohexanoic acid, and X 
denotes all amino acids except Cys. Both PTIP and BRCA1 tandem BRCT 
domains display stronger binding to the pSQ and pS libraries as compared to the 

10 non-phospho libraries. Domain boundaries: PTIP as indicated in Figure 1 (SEQ 
ID NO:26); BRCT1 and 2: amino acids 1634-1863 of SEQ ID NO:27; BRCT1 
alone: amino acids 1634-1751 of SEQ ID NO: 27; BRCT2 alone: 1725-1863 of 
SEQ ID NO: 27; MDC1: amino acids 1880-2089 of SEQ ID NO: 28 
(NP_055456.1); 53BP1: amino acids 1700-1972 of SEQ ID NO: 29 

15 (NP_005648.1); Rad9: amino acids 1025-1309 of SEQ ID NO:30 (NP_0 10503.1). 

Figures 18B and C are autoradiographs showing that the PTIP and BRCA1 
BRCT domains show strong selection for Phe at the (pSer/pThr)Gln +3 position 
(7.0 or 7.5), respectively. Tandem BRCT domains of PTIP and BRCA1 were 
immobilized as glutathione-S-transferase (GST) fusion proteins on glutathione 

20 beads and incubated with non-biotinylated versions of the oriented degenerate 
phosphopeptide libraries described in Figure 1 7 A. Following extensive washing, 
bound peptides were eluted and sequenced. Bar graphs show the relative 
abundance of each amino acid at a given cycle of sequencing compared to its 
abundance in the starting peptide library mixture, as described (Yaffe et al., 

25 Methods Enzymol 328:157-70, 2000). 

Figures 18D,18E, 18F, and 18G show binding of GST-PTIP and BRCA1 
tandem BRCT domains to a filter array of peptide spots, comprising single point 
mutants of the optimal BRCT domain phosphopeptide (left column). Bound GST- 
BRCT domains were detected by blotting with HRP-conjugated anti-GST 
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antibody. The resulting consensus binding motif is indicated in the right column; 
X denotes no dominant selection, (J) denotes residues with aliphatic or aromatic 
side chains, and letters enclosed in square brackets are specifically de-selected. 
The top row indicates the amino acid that was substituted for the optimal amino 
5 acid. Substitution of pSer for pThr enhanced binding for both PTIP and BRCA1 
BRCT domains, consistent with the ITC results. Substitution of pTyr for pThr 
eliminated binding altogether, verifying that tandem BRCT domains are 
pSer/pThr-specific binding modules. Replacement of pThr with Thr, Ser or Tyr 
abrogated tandem BRCT domain binding. The pTQ oriented blots on the left 

10 show strong selection at several positions for both PTIP and BRCA1 BRCT 
domains; especially for Phe in the +3 position in agreement with the oriented 
peptide library screening data. The pS oriented blots on the right show that the +3 
position is the most important position for peptide selection. 

Figure 19A is a Western blot. Lysates from U20S cells were obtained 

15 prior to and 2 hours after the cells were exposed to 10 Gy of ionizing radiation 
(IR). The lysates were incubated with GST-PTIP tandem BRCT domains, and 
bound proteins were detected by blotting with the anti-ATM/ ATR phosphoepitope 
motif antibody. Interaction of the PTIP BRCT domains with these 
phosphoproteins from IR treated cells was disrupted by pre-incubation with the 

20 pSQ peptide library, but not with the SQ peptide library or the pTP library. 

Figure 19B is a Western blot showing that the interaction of the PTIP 
BRCT domains with DNA damage induced phosphoproteins from IR treated 
U20S cells was disrupted by pre-treating the cells with caffeine (25 mM) prior to 
IR exposure or by pre-incubating the beads with an optimal BRCT-binding 

25 peptide (BRCTtide-opt), but not by preincubating the beads with the peptide's 
non-phosphorylated counterpart (BRCTtide-7T). 

Figure 19C is a Western blot showing that tandem BRCT domains of PTIP 
interact with 53BP1 following DNA damage. Endogenous 53BP1 from IR treated 
U20S cells was precipitated with GST-PTIP tandem BRCT domains and detected 
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by incubating with an anti-53BPl antibody. Interaction of GST-PTIP tandem 
BRCT domains with HA-tagged 53BP1, was then detected by anti-HA blotting. 
This interaction was abolished by treating the lysates with lambda phosphatase, by 
pre-incubating the beads with an optimal BRCT-binding peptide (BRCTtide-opt), 
5 but not with its non-phosphorylated counterpart (BRCTtide-7T), or by 

preincubating the beads with the pSQ library, but not by preincubating with the 
SQ library or the pTP library. Treatment of the cells with 25 mM caffeine also 
disrupted the interaction. 

Figure 19D is a Western blot. Lysates from U20S cells 2 hours following 

10 IR were incubated with GST-BRCA1 tandem BRCT domains. DNA damage- 
induced phosphoproteins were detected by blotting with the anti-ATM/ ATR 
phosphoepitope motif antibody. The interaction of the GST-BRCA1 tandem 
BRCT domains with the phosphoproteins were disrupted as in panel B. These 
results show that tandem PTIP and BRCA1 BRCT domains associate with DNA 

1 5 damage-induced phosphoproteins through their phosphopeptide-binding pockets. 
Figures 20A-C are photomicrographs showing immunofluorescence in 
U20S cells demonstrating that full length PTIP forms DNA damage induced foci 
and co-localizes with (pSer/pThr)-Gln proteins, 53BP1, and y-H2AX. Figure 20 A 
shows U20S cells transfected with a full length PTIP-GFP construct (PTIP-FL 

20 residues 1-757). Figure 20B shows U20S cells transfected with a PTIP deletion 
construct in which the last two BRCT domains were removed (PTIP-ABRCT, 
residues 1-550). Figure 20C shows U20S cells transfected with a PTIP construct 
containing only the last two BRCT domains (BRCT) 2 , residues 550-757). In 
Figures 20A-20C, 24 hours following transfection cells were either treated with 10 

25 Gy of ionizing radiation or mock irradiated, allowed to recover for 2 hours, 
stained, and analyzed by immunofluorescence microscopy. 

Figures 21 A and B are photomicrographs showing immunofluorescence in 
U20S cells demonstrating that caffeine attenuates recruitment of PTIP to DNA 
damage foci in response to ionizing radiation. U20S cells transfected with full- 
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length PTIP-GFP cDNA were mock treated or pretreated with lOmM caffeine for 
70 minutes before exposure to lOGy ionizing radiation. (A) In reponse to IR, 
mock-treated U20S cells formed nuclear foci containing PTIP (in green) and 
H2AXp (in red); these two proteins co-localize at sites of DNA damage (merge). 
(B) In response to IR, caffeine treated U20S cells formed reduced numbers of 
nuclear foci; PTIP was mislocalized and did not form discrete nuclear foci (in 
green) and there were reduced numbers of H2AXp (in red) containing foci; 
pretreatment with caffeine effectively abolished co-localization of PTIP and 
H2AXp (merge). 



Figure 


22 


shows 


the 


PTIP amino acid sequence. 


Figure 


23 


shows 


the 


PTIP nucleic acid sequence. 


Figure 


24 


shows 


the 


BRCA1 amino acid sequence. 


Figure 


25 


shows 


the 


BRCA1 nucleic acid sequence. 


Figure 


26 


shows 


the 


MDC1 amino acid sequence. 


Figure 


27 


shows 


the 


MDC1 nucleic acid sequence. 


Figure 


28 


shows 


the 


53BP1 amino acid sequence. 


Figure 


29 


shows 


the 


53BP1 nucleic acid sequence. 


Figure 


30 


shows 


the 


Rad9 amino acid sequence. 


Figure 


31 


shows 


the 


Rad9 nucleic acid sequence. 
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Description of the Invention 
The present invention features a method for identifying kinase targets, an 
exemplary kinase target, the Polo box domain of the Polo-like kinase, and 
exemplary peptide mimetics that interfere with signaling by the Polo-like kinase. 
5 We have developed a proteomic approach that allows us to identify 

virtually any peptide-binding domain by simultaneously screening a polypeptide 
expression library with a biased peptide library. We have used this method to 
identify, for example, targets downstream of kinases in signaling pathways. 
This strategy involves using an immobilized library of partially degenerate 

10 phosphopeptides, biased toward a kinase phosphorylation motif, to isolate 
interacting effector proteins targeted by substrates of that kinase. Using this 
approach for cyclin-dependent kinases, we identified the Polo-box Domain (PBD) 
of the mitotic kinase Plk-1 as a phosphoserine/threonine binding domain. Polo- 
like kinases (Plks) perform crucial functions in cell-cycle progression and multiple 

1 5 stages of mitosis. Plks are characterized by the presence of a C-terminal non- 
catalytic region containing two tandem Polo-boxes, termed the Polo-box domain 
(PBD). 

In addition, we have discovered that the PBDs of human, Xenopus, and 
yeast Plks all recognize similar phosphoserine/threonine-containing motifs. The 

20 1.9 A X-ray structure of a human Plkl PBD-phosphopeptide complex shows that 
the Polo-boxes p 6 ot structures. They associate to form a novel 12-stranded p- 
sandwich domain, to which the phosphopeptide-binds within a conserved, 
positively-charged cleft located at the edge of the Polo-box interface. Mutations 
designed to specifically disrupt phosphodependent interactions abolish cell-cycle 

25 dependent localization and provide compelling phenotypic evidence that PBD- 
phospholigand binding is necessary for proper mitotic progression. In addition, 
phosphopeptide-binding to the PBD stimulates kinase activity in full-length Plkl, 
suggesting a conformational switching mechanism for Plk regulation and a dual 
functionality for the PBD. Together, our data reveal a central role for PBD- 
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phosphoprotein interactions in many, if not all, cellular functions of Plks. This 
finding provides a structural explanation for how Plk-1 localizes to specific sites 
within cells in response to Cdk phosphorylation at those sites. 

Activation of signaling cascades in eukaryotic cells involves the directed 
5 assembly of protein-protein complexes at specific locations within the cell. This 
process is controlled by protein phosphorylation on serine, threonine and/or 
tyrosine residues that directly or indirectly regulate protein-protein interactions, 
often through the actions of modular binding domains. Historically, studies of 
phospho-binding domains have focused on SH2 and PTB domains, which bind to 

10 specific phosphotyrosine-containing sequence motifs. Until recently, it was 

thought that phosphorylation of proteins on serine and threonine residues was not 
responsible for direct interactions with modular binding domains but instead 
induced conformational changes to regulate function. However, a number of 
domains (14-3-3 proteins, FHA domains, WD40 repeats of F-box proteins, MH2 

15 domains and the WW domain of the prolyl isomerase Pinl) have been identified 
that bind directly to short phosphoserine or phosphothreonine-containing 
sequences to control cell cycle progression, coordinate the response to DNA 
damage, and regulate apoptosis. 

The vast majority of intracellular proteins are phosphorylated on serine or 

20 threonine residues at some point during their lifetime. Furthermore, known 
phosphoserine/threonine binding domains comprise a diverse structural group, 
demonstrating that many divergent tertiary folds have acquired a phospho- 
dependent binding function through evolution. Approximately one-third of the 
modular protein domains identified by Pfam and SMART on the basis of sequence 

25 homology have no known function. Our technique enables the identification of 
additional phosphopeptide binding modules that target serine/threonine residues. 

2x2 Biased Library Screening 

To design a general proteomic screen capable of identifying novel 
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phosphoserine/threonine binding modules, we took advantage of the observation 
that protein kinases and phosphopeptide binding domains seem to have co-evolved 
to recognize overlapping sequence motifs (Yaffe et al., Nat. Biotechnol 19:348- 
353, 2001; Obata et al., Biol Chem. 275:36108-361 15, 2000). For example, the 
5 basophilic protein kinase, Akt, phosphorylates substrates at sites that contain the 
core motif RXRSX[S/T] and 14-3-3 proteins bind to a subset of these 
phosphorylated sites that have the optimal motif RSX[pS/pT]XP. Cyclin- 
dependent kinases (Cdks) phosphorylate substrates at [S/T]PXR motifs, and the 
WW domain of the proline isomerase Pinl recognizes the phosphorylated forms of 

10 these [pS/pT]P sites to mediate isomerization of the proline residue. Importantly, 
this apparent overlap between kinase and phospho-binding motifs is not perfect. 
Instead, limited overlap allows combinatorial interactions between substrates of 
particular kinases and downstream binding modules. 

Our motif-based strategy for identifying pSer/Thr-binding domains 

15 involved biasing a library of partially degenerate phosphopeptides towards the 
phosphorylation motif of a kinase and then using an immobilized form of this 
library as bait in a screen for interacting proteins translated in vitro from a cDNA 
library. 

Using a library of phosphopeptides biased towards motifs phosphorylated 
20 by cyclin-dependent kinases (Cdks), we identified the C-terminal Polo-box 
containing region of the human Polo-like kinase, Plk-1, as a specific 
phosphopeptide recognition module. It has been previously shown that this non- 
catalytic region is critical both for Polo kinase subcellular localization and for 
proper mitotic progression in yeast and human cells. Our findings provide the first 
25 description of a biochemical mechanism through which Plk-1 performs these 

essential mitotic functions. Furthermore, the identification of the conserved Plk-1 
PBD as the latest member of the growing superfamily of pSer/Thr-binding 
domains suggests that phospho-specific docking may be a general mechanism for 
Ser/Thr kinase signaling in eukaryotic biology. 
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To identify pSer/Thr-binding domains involved in cell cycle regulation, we 
designed a pThr-Pro-oriented peptide library biased to resemble the motif that 
would be generated by the action of cyclin-dependent kinases and MAP kinases, 
as well as that recognized by the mitotic phosphoprotein-specific monoclonal 
5 antibody MPM-2, whose pSer/Thr-binding motif we had determined previously 
(Yaffe et al., Science 278:1957-1960, 1997). The library was constructed with a 
flexible linker and an N-terminal biotin tag, allowing an immobilized form of this 
library to be used as bait in an interaction screen against a library of proteins 
produced by in vitro expression cloning (Lustig et. al., Methods Enzymol 283:83- 

10 99, 1997; Figure 1A). 

This library vs. library screening approach is the reverse of a traditional 
peptide library screen in which a single purified domain is assayed against a 
degenerate peptide library to reveal the optimal binding motif. In the approach 
presented here, a degenerate but motif-biased peptide library is used to screen for 

15 novel binding domains. By using a collection of peptides biased towards the motif 
of a protein kinase superfamily, the screen casts a larger net than would be 
possible if only a single peptide were used as bait. To control for phospho- 
independent peptide binding, an identical library was constructed with Thr 
substituted for the fixed pThr residue (Figure 1 A). 

20 The pThr-Pro-oriented peptide library, and its non-phosphorylated Thr-Pro 

library counterpart were immobilized on Streptavidin beads and screened in 
parallel against 680 individual pools of in vitro translated [ 35 S]-labeled proteins. 
Each pool contains -30 radiolabeled proteins/pool that are detectable by SDS- 
PAGE/autoradiography (Figure IB, "pool" lanes). As shown in Figure IB, 

25 proteins produced by in vitro translation often failed to bind either library at all or 
bound more strongly to the non-phosphorylated peptide library-containing beads. 
However, we identified 7 distinct pools containing radiolabeled translation 
products that bound preferentially to the pThr-Pro library compared with the Thr- 
Pro library (asterisks in Figure IB). 
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Plasmid pools containing these positively scoring hits were progressively 
subdivided and re-screened for phospho-binding until individual clones were 
isolated and sequenced. Of the 7 positive clones, 3 were successfully recovered, 
two of which are reported here. One of the clones, 109-B7, was found to encode . 
5 the prolyl isomerase Pinl, which is known to bind and isomerize pThr-Pro motifs 
recognized by the monoclonal antibody MPM-2. Its isolation, therefore, validated 
the feasibility of our screening approach. 

A second positively scoring hit, clone 407-C6, was found to encode the C- 
terminal 80% of the mitotic kinase Plk-1 (polo-like kinase- 1, amino acids 95-603). 

10 This clone was missing critical components of the Plk-1 kinase domain, including 
the glycine rich loop (amino acids 60-66) and the invariant lysine (K82), implying 
that phosphopeptide binding was independent of Plk-1 kinase activity. Phospho- 
specific binding by the full-length transcript of this incomplete Plk-1 clone was 
less pronounced than binding by Pinl (Figure IB). Partial translation products or 

15 proteolytic breakdown fragments arising from this clone (Figure IB, arrowheads) 
showed strong discrimination for the phosphorylated peptide library, suggesting 
that these fragments included a functional phosphopeptide binding domain. 

Identification of Polo-Box Domain as a Phosphopeptide Recognition Module 
20 A hallmark feature of the Polo kinase family is the presence of a highly 

conserved C-terminal region downstream from a conserved amino-terminal kinase 
domain (Figures 2A and B). This region includes two blocks of strong homology, 
termed Polo Boxes. To define the limiting fragment of Plk-1 responsible for 
phosphospecific binding, we generated a series of deletion constructs based on an 
25 alignment of the C-terminal regions of human Plk-1, Xenopus Plx-1 and 
Drosophila Polo (Figure 2B), and analyzed these deletion fragments for 
phosphopeptide-specific binding. As shown in Figure 2A, a construct that began 
immediately after the kinase domain and extended to the last residue of the protein 
(residues 326-603) demonstrated strong and specific binding to the 
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phosphothreonine-proline peptide library compared with the non-phosphorylated 
control. Notably, this construct was superior to the parent clone 407-C6 in 
discriminating for phosphopeptides. Neither of the individual Polo Boxes alone 
(denoted PB1 and PB2), nor a construct containing both Polo Boxes but lacking 
5 the linker region between the kinase domain and PB1, was capable of 

phosphopeptide binding (Figure 2A). Furthermore, a construct that included the 
linker region and PB1 but not PB2 was also unable to bind phosphopeptides. 
Thus, it appears that the linker region together with both Polo-boxes functions 
together as a single phosphopeptide-binding module, and we therefore propose 

10 that this segment be called the Polo-box Domain (PBD). Intriguingly, this region 
encompassing both Polo-boxes has been previously shown to regulate the 
localization of Plk-1 to centrosomes and kinetochores during prophase and to the 
midbody during late stages of mitosis. Significantly, neither Polo-box alone was 
sufficient for this localization function, though mutations within PB1 were 

1 5 sufficient to disrupt it. 

The Plk-1 Polo-box Bonnaimi Consensus Motif 

A central feature of our screen for phosphopeptide-binding domains is that 
any pSer/Thr-binding domain identified through interaction with phosphopeptide 

20 library-immobilized beads is amenable to subsequent determination of its optimal 
binding motif using a standard "forward" peptide library screening approach. A 
GST fusion protein of the Plk-1 PBD was therefore expressed in bacteria, 
immobilized on glutathione beads, and incubated with degenerate phosphopeptide 
libraries oriented on a fixed pThr-Pro (Figure 3 A) or pSer-Pro motif (Figure 3B). 

25 Following extensive washing, the PBD-bound peptides were eluted and 

sequenced, and the amount of each amino acid in every degenerate position was 
compared to that present in the starting library mixture to derive amino acid 
selectivity ratios. Surprisingly, the Plk-1 PBD displayed an extraordinarily strong 
and novel selection for Ser in the pThr-1 position when the pThr-Pro library was 
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used. Extremely strong selection for Ser was also observed in the -1 position 
when the PBD was assayed using the fixed pSer-Pro library. Binding of the PBD 
to a phosphoserine-containing peptide library is noteworthy in itself, since at least 
one other family of phosphopeptide-binding modules, FHA domains, appear to 
5 bind only to phosphothreonine-containing motifs. The relative selection values 
observed for Ser in either the pThr-1 or pSer-1 position, 5.9 and 8.1 respectively, 
are among the largest we have observed for any domain whose specificity has 
been previously determined by peptide library screening. 

Since the Plk-1 PBD was isolated in a screen for domains that bind to pThr- 

10 Pro motifs, it was important to determine the relative importance of Pro in the 

pThr+1 position for PBD recognition. To accomplish this, peptide library screens 
were performed with libraries containing a fixed pThr residue, a fixed pSer 
residue, fixed Ser-pThr residues, or fixed Ser-pSer residues (Table 1, Figures 3C, 
and 3D). Little selection was observed for proline in the pThr/pSer+1 position 

15 when serine was not fixed in the pThr/pSer-1 position (Table 1). Inclusion of 
serine at this position in a Ser-pThr oriented library, however, unmasked a 
moderate selection (1.7) for proline at pThr+1 (Figure 3C and Table 1). Proline 
selection (1.8) was also uncovered at this position when a Ser-pSer oriented 
library was used (Figure 3D and Table 1). Notably, synergistic selection between 

20 serine and proline was also observed in reverse such that inclusion of a fixed Pro 
residue in the peptide libraries led to a higher selection for serine (Table 1). 

Table 1 , below, summarizes the results obtained from phosphopeptide 
motif selection screening. 
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Table 1 . pT and pS Peptide Motif Selection by Plk-1 Polo Box Domain 
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A GST fusion of the Plk-1 Polo Box Domain was screened for binding to 
six phosphopeptide libraries, which contained the sequences 
5 MAXXXXpTPXXXXAKKK SEQ ID NO:3 1 , MAXXXXpTXXXXAKKK SEQ 
ID NO:32, MAXXXXSpTXXXXAKKK SEQ ID NO:33, 
MAXXXpSPXXXAKKK SEQ ID NO:34, MAXXXXpSXXXXAKKK SEQ ID 
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NO:35, and MAXXXXSpTXXXXAKKK SEQ ID NO:36, where X indicates all 
amino acids except Cys. Residues showing strong enrichment are underlined. 
Selection for Pro (1 .4) was observed in the -4 position in the X4SPTX4 and 
X4SPSX4 screens. Slight selection for aliphatic and aromatic residues was 
5 observed in the +2 position in most screens. Little or no selection was observed in 
the -5, +3 5 +4, or +5 positions in any of the screens. 

These results suggested that the presence of Pro in the pThr/pSer+1 
position, while helpful, was not absolutely required for binding. In agreement 
with this, the Plk-1 PBD bound in a phospho-specific manner to bead-immobilized 

10 peptide libraries containing either a fixed pThr-Pro dipeptide or an isolated pThr 
alone (Figure 3E). In contrast, the other protein isolated in our screen, full-length 
Pinl, bound only to the pThr-Pro peptide library beads. 

To verify the results of oriented peptide library screening, binding of 
individual phosphopeptides to the Plk-1 PBD was measured by isothermal titration 

15 calorimetry (Figure 4A and 4B). The optimal phosphopeptide ligand 

(PoloBoxtide-optimal), containing the core sequence Met-Gln-Ser-phoshoThr-Pro- 
Leu derived from peptide library screening, bound tightly to the Plk-1 PBD with a 
dissociation constant of 280 nM. Furthermore, it formed a 1:1 protein/peptide 
complex, indicating that separate phosphopeptides were not interacting 

20 simultaneously with each of the two polo boxes within the PBD. Substitution of 
threonine for phosphothreonine (PoloBoxtide 8T) resulted in complete loss of 
binding, reiterating the absolute dependence of interaction on the presence of a 
phosphate group. Substitution of phosphoserine for phosphothreonine within the 
optimal PBD motif maintained peptide binding to the Plk-1 PBD in agreement 

25 with the peptide library screening results, albeit with a seven-fold drop in affinity. 
In contrast, substitution of phosphotyrosine for phosphothreonine completely 
abrogated binding, demonstrating conclusively that the Plk-1 PBD is a pThr/pSer- 
specific binding domain. The extraordinarily strong selection observed for Ser in 
the pThr/pSer-1 position within the Plk-1 PBD binding motif was confirmed using 
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a series of mutant peptides. When this Ser was replaced with either of the 
sterically small amino acids Ala or Gly, with the hydroxyl containing amino acid 
Thr, or with the homologous amino acid Cys, no peptide binding was detectable. 
Moderate selection for Pro in the pThr/pSer+1 position was verified by a greater 
5 than five-fold increase in Kd when another P-turn forming residue, Asn 3 was 

substituted for Pro in this position. Based on the oriented peptide library screening 
data (Figure 3, Table 1) and these ITC results, we therefore propose that the core 
consensus motif recognized by the Plk-1 PBD is S-[pT/pS]-(P/X). 

1 0 Physiological Substrates of PBD 

The monoclonal antibody MPM-2 (Mitotic Phosphoprotein Monoclonal-2), 
originally raised against mitotic HeLa cell extracts, recognizes a conserved 
pSer/pThr-Pro epitope present on ~ 50 phosphoproteins that are localized to 
various mitotic structures. The initial screen from which the Plk-1 PBD was 

15 identified used a peptide library that was partially biased to resemble the MPM-2 
epitope. A number of important mitotic regulators that are recognized by this 
antibody, including Cdc25, Weel, Mytl, Topoisomerase II alpha and inner 
centromere proteins (INCENP), contain one or more exact matches of the S- 
[pS/pT]-P PBD-binding motif. We therefore investigated whether the Plk-1 PBD 

20 bound to MPM-2 reactive proteins. HeLa cells were treated with aphidocolin to 
induce a Gl/S arrest or with nocodazole to induce a G2/M arrest and cell lysates 
were analyzed by immunoblotting (Figure 5A). As expected, the number of 
MPM-2 reactive proteins was greatly enhanced in the mitotically-arrested cells. 
Many of these MPM-2 reactive mitotic phosphoproteins were specifically bound 

25 by the Plk-1 PBD, suggesting that phosphorylation of these proteins by proline- 
directed mitotic kinases generated a PBD-binding site. Furthermore, the Plk-1 
PBD bound to a different and somewhat smaller subset of MPM-2 epitope- 
containing proteins than those that bound to Pinl (Figure 5 A), which was expected 
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given that the MPM-2 epitope motif more closely resembles the optimal consensus 
motif for Pinl than that of the Plk-1 PBD. 

To determine whether the Plk-1 PBD associates with MPM-2 epitopes 
through its phosphopeptide binding pocket, peptide competition assays were 
5 performed. Pre-incubation of the Plk-1 PBD with its optimal phosphopeptide 
ligand dramatically inhibited the binding of MPM-2 epitopes (Figure 5B, 'opt'). 
In contrast, the non-phosphorylated analogue ('8T') or a peptide with Val 
substituted for Ser in the pT-1 position ('7V') had no effect. 

One particular MPM-2 antigen that is also known to be phosphorylated and 

10 regulated by Plk-1 and its Xenopus homologue is the cell-cycle regulated protein 
phosphatase Cdc25. We therefore investigated whether Cdc25C associated with 
the Plk-1 PBD in a cell-cycle-regulated and phospho-specific manner. During 
mitosis, Cdc25C undergoes a dramatic reduction in gel mobility due to extensive 
phosphorylation at its N-terminus. The Plk-1 PBD was found to interact only with 

15 this mitotically up-shifted form of Cdc25C (Figure 6A). Pre-incubation of the 
Plk-1 PBD with its optimal phosphopeptide ligand, but not with the 8T or 7V 
mutant peptides, completely prevented this association, demonstrating that it was 
mediated through the phosphopeptide binding pocket of Plk-1. During mitosis, 
Cdc25C is known to be phosphorylated on five conserved Ser/Thr-Pro sites within 

20 its N-terminus. One of these sites, Thr^o (corresponding to Thr 138 in Xenopus 
Cdc25C) contains a conserved Plk-1 PBD consensus motif (Figure 6B). To 
investigate whether this site was important for the Cdc25C-Plk-l interaction, 
HeLa cells were transfected with HA-tagged wild-type Cdc25C, or with Thr 130 Ala 
or Ser 12 9Val point mutants of Cdc25C expected to disrupt the PBD-binding motif. 

25 Following mitotic arrest with nocodazole, the Plk-1 PBD bound strongly only to 
the wild-type protein, but only very weakly to either of the point mutants, 
indicating direct interaction between the Plk-1 PBD phosphopeptide -binding 
pocket and a mitotically-phosphorylated PBD consensus motif in Cdc25C (Figure 
6C). Furthermore, both of these point mutants had a decreased electrophoresis 
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mobility shift when analyzed on lower percentage gels (Figure 6D), suggesting 
that mutations which impair Plk-1 PBD binding result in incomplete Cdc25C 
phosphorylation in vivo. 



5 Centrosomal localization of the Plk-1 PBD occurs through its 
phosphopeptide-binding pocket. 

Plk-1 localizes to centrosomes and kinetochores in prophase and to the 
spindle mudstone during late stages of mitosis. Centrosomal localization has been 
shown to require both the PB1 and PB2 regions, but not kinase activity, since 

10 localization is maintained when Lys 82 , which is mediates phosphate transfer, is 
mutated to Met. To investigate whether the phosphopeptide binding function of 
the Plk-1 PBD was critical for its centrosomal localization, U20S cells were 
mitotically arrested with nocodazole, permeablized with Streptolysin-O, and 
incubated with GST-Plk-1 PBD in the absence or presence of peptide competitors. 

1 5 The Plk-1 PBD was observed to localize to the centrosomes of late prophase- 
arrested cells (Figure 7A), as verified by co-staining with an anti-y-tubulin 
antibody. 

This centrosomal localization was significantly disrupted in the presence of 
an optimal Plk-1 PBD phosphopeptide but was unaffected when the assay was 

20 performed using the same concentration of the non-phosphorylated peptide 

analogue (Figures 7A and 7B). This observation, together with published data 
showing that the C-terminus of Polo-like kinases is essential for their function in 
vivo, strongly suggests that intracellular targeting of Plk-1 to critical substrates is 
mediated through interaction of the PBD phosphopeptide pocket with 

25 phosphorylated motifs in mitotic structures. 
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The Plk-1 PBD and regulation of mitotic progression by cyclin-dependent 
kinase priming 

Our identification of the Plk-1 PBD as a novel phosphoserine/threonine- 
binding domain adds another member to the growing superfamily of pSer/Thr- 
5 binding modules and demonstrates the general utility of our phospho-motif-based 
affinity screen for discovering and functionally characterizing novel signaling 
domains that function downstream of protein kinases. This screening technique 
can be used to identify binding modules interacting with substrates of any kinase 
whose phosphorylation motif is known. Other techniques that identify protein- 

10 protein and protein-peptide interactions, such as yeast 2 -hybrid and phage display 
approaches cannot be used in screens for phospho-binding domains since reliable 
and constitutive phosphorylation of a diverse collection of bait sequences is 
required. A further strength of our technique is that any domain isolated through 
screening with bead-immobilized peptide libraries yields an optimal consensus 

15 binding motif when the domain is subsequently analyzed by traditional peptide 
library screening. This allows the motif for the pSer/Thr-binding domain to be 
combined with that of the potential phosphorylating kinase(s) in database 
searching and protein sequence analysis and should facilitate the proteome-wide 
prediction of ligands within a common signaling pathway. 

20 The C-terminal region of Polo-like kinases has long been recognized as 

essential for their in vivo function in mitosis and cytokinesis, but its structural 
mechanism has remained mysterious. Mutations within this region of Plk-1 and 
its S. cereviseae homologue, Cdc5, abolish their ability to rescue a temperature- 
sensitive mutant of cdc5 despite the presence of a fully functional kinase domain. 

25 When expressed alone, the C-terminal domain of Polo-like kinases localizes to 
centrosomes and the spindle midzone similar to the full-length kinase, and its 
overexpression causes mitotic and cytokinetic arrest. 

We have shown that the C-terminal domain of Plk-1 is a 
phosphoserine/threonine-binding module whose phospho-binding pocket binds to 
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known Polo substrates and mediates localization to subcellular sites where 
endogenous Polo kinases are found. In the basal state the PBD binds to the kinase 
domain, inhibiting its phosphotransferase activity. In addition to overcoming this 
inhibition, maximal activation of the kinase domain also requires phosphorylation 
5 in its activation loop by upstream kinases such as xPlkkl/SLK. This requirement 
for both priming phosphorylation of substrates and activation loop 
phosphorylation provides a molecular switch that regulates Plk-1 kinase function 
at discrete stages of the cell cycle. In addition, it provides a potential means for 
mitotic checkpoint control, since neither phosphorylation of the activation loop 

10 nor substrate priming phosphorylation alone would be sufficient for proper 
activation of Polo kinases in vivo. 

A number of striking parallels between the PBD of Plk-1, SH2 domains in 
Src family kinases, and FHA domains in the Rad53/Chk2 family of checkpoint 
kinases are apparent. Like the Plk-1 PBD, SH2 domains of Src-family kinases 

15 both inhibit kinase activity in the inactive state and facilitate substrate targeting 
when Src kinases have been activated by phosphorylation on their activation 
loops. In Src kinases, the mechanism of inhibition involves intramolecular 
binding of the SH2 domain to a pTyr motif at the end of the kinase domain. It 
remains unknown whether Polo kinase family inhibition by the PBD involves a 

20 similar interaction with internal pSer/pThr sites, or whether an alternative PBD 
surface is involved. Members of the Chk2 kinase family contain one or more 
pThr-binding FHA domains in addition to the kinase module. The FHA domain(s) 
are critical for proper Chk2 function in response to DNA damage and for the 
phospho-dependent targeting of Chk2 into larger multimolecular complexes where 

25 activation occurs. 

We found the optimal motif for Plk-1 PBD binding to be S-[pS/pT]-P/X. 
Differences in PBD selectivity for amino acids flanking the pSer/Thr position are 
likely to be biologically important for the interaction of Polo kinases with their 
substrates in vivo. The primary role of the +1 Pro may be to link phospho- 
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dependent PBD binding to activation of cyclin-dependent kinases that 
phosphorylate the motif, providing a means to temporally and spatially regulate 
the action of Polo-like kinases during mitosis. The absolute requirement for Ser in 
the -1 position provides strong discrimination for Plk-1 binding to only a limited 
5 subset of mitotic kinase substrates. In addition, we found that the motif 

recognized by the Plk-1 PBD partially overlaps with the proline-directed sequence 
motif recognized by the monoclonal antibody MPM-2 which reacts against a large 
number of mitotically phosphorylated proteins, and we demonstrated a direct 
interaction between the PBD phosphobinding pocket and MPM-2 reactive proteins 

10 in pull-down experiments with mitotic cell extracts. This finding provides an 
elegant explanation for the progressive accumulation of MPM-2 immuno- 
reactivity and Polo kinase localization observed at maturing centrosomes, and 
suggests that generation of MPM-2 epitopes by Cdks and other mitotic kinases 
triggers PBD-mediated recruitment of Polo kinases to specific mitotic structures. 

1 5 Both Cdks and Polo kinases have been implicated in activating the 

phosphatase Cdc25, leading to desphosphorylation and activation of Cdc2/Cyclin 
B and progression through mitosis. The relative roles of Cdks and Polo kinases in 
Cdc25 activation, however, remains controversial. Our finding that the Plk-1 PBD 
binds to one or more critical Cdk sites on Cdc25C suggests a molecular rationale 

20 for 2-step activation of Cdc25 that has been postulated to drive auto-amplification 
of Cdc2/CyclinB activity. In prophase, low levels of Cdc2/CyclinB activity are 
insufficient to fully activate Cdc25, but provide priming phosphorylation of Cdc25 
for interaction with the PBD. Subsequent activation of Polo kinases later in 
mitosis by activation loop kinases such as Plkkl/SLK leads to an initial wave of 

25 Cdc25 activation, which generates more Cdc2/Cyclin B activity, primes additional 
Cdc25 molecules for activation by Polo-like kinases, and results in a positive 
feedback loop for the production of additional Cdc2/Cyclin B activity (Figure 8). 
This model is able to explain the result of Toyoshima-Morimoto et al (EMBO 
Rep,, 3:341-348, 2002) that maximal intracellular targeting and activation of 
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Cdc25, even in the presence of constitutively active Plk-1, still requires the co- 
expression of Cyclin Bl. 

Increased levels of Plk expression have been detected in a variety of human 
tumors and tumor cell lines, and high levels of expression correlate with poor 
prognosis. The PBD would be an attractive target for the design of anti- 
proliferative chemotherapeutics since its compact tripeptide binding motif may be 
particularly amenable to the design of small molecule peptidomimetics. 

Optimal phosphopeptide-binding motifs for the PBDs from all members of 
the human Plk family, Xenopus Plxl and Saccharomyces cerevesiae Cdc5p were 
determined by oriented peptide library screening as described above. Since we 
initially isolated the Plkl PBD in a search for domains that recognize a pThr-Pro- 
containing motif, primary screens were performed using peptide libraries 
containing a fixed pThr-Pro core flanked on both sides by four degenerate 
positions. As seen in Tables 2 and 3, the five PBD's examined each selected for 
distinct but largely overlapping motifs. 
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Table 2 Phosphothreonine Peptide Motif Selection by Human Polo Kinase 

Family PBDs 

•5 -4 -3 -2 -1 +1 +2 

Plk1 





M(1.5) 
F(1.1) 


M (1.3) 
Y(1.3) 
H(1.3) 
F(1.2) 
K(1.2) 


A (1.4) 
H (1.4) 
M(1.4) 
T(1-3) 
F(1.3) 


S (5.91 
A (1.6) 


PT 


P 


F(1.2) 
1(1.2) 
K(1.2) 


P<1.4) 
F(1.1) 


P(1-S) 
F(1.3) 
M(1.3) 
L(1.2) 
I (I D 


M(1.5) 
F(14) 
L(1.2) 


0(1.5) 
A (1.5) 
H (1.5) 
M(1.4) 
F(1.3) 
T(1.2) 


s 


PT 


P(16) 
M (1.3) 


L(1-2) 
K (1.1) 
V(1.1) 



Plk2 





F(1.9) 


0(1.9) 


T(2.1) 




1(1.6) 


M(1.8) 


H ( 2.1) 




M(1.5) 


H(1-6) 


0(1-2) 




L(1.4) 


F(1.3) 






P(1.1) 






Pl?4l 


M{1.5) 


0(1-9) 


JIPM 


F(1.4) 


F(1.5) 


T(1.6) 


H (2.0) 


1(1.2) 


P(1.4) 


M(1.6) 


0(17) 




L(1.4) 


H(1-6) 






I (13) 


F(1.2) 






V(1-2) 







Sf7S1 pT P F(1.5) 

L(1.5) 
1(1-3) 
V(1.1) 

S pT P(1.7) K(1.5) 

L(1.2) 
1(1.1) 



Plk3 



P(1.2) 



I (15) 


M{1.6) 


T(1.6) 


S (3.0) 


L(1.4) 


L(1.3) 


H (1-4) 




V(1.3) 


F(1.3) 






F(1.2) 








P(1.2) 








L(1.2) 


A (1.5) 


T(2.6) 


S 


1(1.2) 


M(1.2) 


H (1.6) 






F(1.2) 








I (12) 







PT 



PT 



P (1.6) 
0(1.4) 
E (13) 



K(1.3) 
V(1.2) 
F(1.2) 



K(1.4) 



GST fusions of the Polo-box Domains (PBDs) from hPlki, hPlk2, and hPlk3 wera screened for binding to 
phosphopeptide libraries containing die sequences MAXXXXpTPXXXXAKKK and MAXXXXSpTXXXXAKKK, where X 
indicates all amino acids except Cys. Residues showing strong enrichment are underlined. 
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Table 3 Phosphothreonine Peptide Motif Selection 
by Polo Kinase PBD Orthologs 



•5 


-4 


-3 


-2 


-1 




+1 


+2 


Pbcl 




P<21) 
1(1.6) 
L(t.3) 
M(1.2) 


F(1.6) 
L(1.5) 
11(1.5) 


T(2.i) 
H(1.7) 


5(7 31 


PT 


P 


1(1.6) 

M1-5) 
V(1.1) 


P(t-B) 
F(1.4) 


P(1.6) 
F<1.5) 
L(1.5) 
1(1.4) 
M{1.3) 


M(1.5) 
M1.4) 


Tt3D) 

H (1.8) 
0(1.3) 


s 


PT 


P<1j9) 


K(1.4) 
1(1.3) 
L<12) 


CdcS 




1/1(1.9) 
L(1.5) 
1(1.4) 
F<1.2) 


A(2.5) 
M(1.5) 
I s (1.1) 


T(2.4) 
A(1.8) 
0(1-5) 

M (1-4) 


S (5.3) 


PT 


P 


X 


P(1.3) 


L(2J>) 

M(1.7) 

1(1.5) 

P<1-S) 

V(1-1) 


A 41 
V<1-3) 
1(1.2) 


A (2.1) 
<> (1.7) 
HI*) 
H(US) 
M(1-3) 


s 


PT 


P(W) 


L(1JJ) 
K1.1) . 



GST fuator* a! thn Pcto-ba* Domains <PBOb> frtrn ^onqpas Pttt And S. GerwteiatfCdGSp were atmettod lor binding to 
pf^hcpep&fc Ubrartfw containing the saquancet* MAXXXXpTPXXXXAKKK and MAXXXXSpTXXXXAKKK, where X 
indicates £dl&niirtf> acids excfifiCyn. Residues showing *&o09 fennctatiefll are lind&ffrtcd. 



All of the PBDs showed unequivocal selection for Ser in the pThr-1 
position with selectivity ratios (i.e. the mol% of Ser in the PBD-bound peptides at 
the pThr-1 position divided by the mol% of Ser in the starting library mixture at 
the pThr-1 position) ranging from 3.0 to 7.5. Motif similarity occurs even though 
these PBDs vary considerably in amino-acid sequence and the respective human 
Plks perform divergent cellular functions. The PBDs as a group consistently 
demonstrated moderate selection for Thr, His, Gin, and Met in the pThr-2 position. 
There was general selection amongst all PBDs for aliphatic and aromatic residues 
in the pThr-3, pThr-4 and pThr+2 positions, although Cdc5p showed a particularly 
strong and unique selection for Ala in the pThr-3 position, while Plk2 showed 
strong and unique selection for Gin at this position. All PBDs except Cdc5p also 
selected for Pro in the pThr-4 position and Lys in the pThr+2 position 
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Based on these data, secondary peptide libraries containing a fixed Ser- 
pThr core were used to further refine the motifs and investigate the relative 
importance of Pro in the pThr+1 position. These screens revealed modest 
selection for Pro at pThr+1 for all PBDs, with selectivity ratios ranging from 1.4 to 
5 1 .9 (Tables 2 and 3). Selection at other motif positions for each PBD was 

consistent with those obtained using the pThr-Pro library, though we were now 
able to observe significant and conserved selection for Pro and Phe in the pThr-5 
position. (pT-5 was degenerate in the Ser-pThr library, but was a fixed Ala 
residue in the pThr-Pro-oriented library.) Thus, it appears that the PBDs of all Plks 
10 investigated, including all conventional human Plk homologues, select a similar 
motif that can be most generally represented by the consensus sequence: 
[Pro/Phe]-[<|>/^ 

SEQ ID NO:38, where (j) represents hydrophobic amino acids. 

The striking selection observed for Ser in the pThr-1 position in all PBDs 
15 was examined in detail for the human Plkl PBD, which binds to its optimal motif, 
Pro-Met-Gln-Ser-pThr-Pro-Leu (SEQ ID NO:39) (Table 2), with a Ka of 280 nM 
(Figure 9A). 

A variety of small side-chain amino-acids were therefore substituted in the 
pThr-1 position, and peptide binding to the Plkl PBD measured using isothermal 

20 titration calorimetry (ITC) (Figure 9A). Surprisingly, replacement of Ser with 
Gly, Ala, the hydroxyl-containing amino-acid Thr, or the Ser isostere Cys, 
completely abrogated Plkl PBD-phosphopeptide binding. We had previously 
observed that replacement of Ser at the pThr-1 position with Val, the amino-acid 
showing the lowest selection in this position, was sufficient to eliminate peptide 

25 binding (Elia et al., Science 299:1228-1231, 2003). Nevertheless, the finding that 
replacement of Ser with a variety of chemically similar amino acids also 
completely disrupted the interaction between the PBD and free phosphopeptides in 
solution was unexpected. 
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To extend this analysis, each amino acid in the eight positions flanking the 
phosphothreonine within the optimal Plkl PBD binding motif was substituted with 
each of the remaining nineteen naturally occurring amino acids using a solid phase 
array of immobilized phosphopeptides (Figure 9B). This conclusively 
5 demonstrated that only Ser was tolerated in the pThr-1 position (Figure 9B). 
Selectivities at other positions were generally consistent with the results of 
oriented peptide library screening. Cys and Gly, however, were selected at the 
pThrfl position at least as strongly as Pro in the immobilized phosphopeptide 
assay. Cys is routinely omitted during construction of oriented peptide libraries to 

10 minimize cross-linking and oxidation effects. Higher relative selection for Gly in 
the context of immobilized peptides than in solution phase peptide library assays 
may be due, in part, to the greater entropic penalties associated with ordering Gly 
residues compared with Pro residues when both ends of a peptide are free. 
Alternatively, these subtle differences may reflect the fact that the peptide filter 

15 assay examines individual point mutations in the context of a single amino-acid 
sequence, while oriented peptide library screening samples an entire ensemble of 
sequence motifs simultaneously. Regardless, Pro probably represents the most 
'physiological' amino acid in the pThr+1 position, since the phosphorylation event 
necessary for PBD binding is likely to be catalyzed primarily by Pro-directed 

20 kinases such as Cdks and MAP kinases. 

Overall Structure of the Plkl PBD 

The boundaries of the minimal PBD within the C-terminal regions of both 
Plkl and Cdc5p were determined using limited proteolysis and mass-spectrometry. 
25 Studies using V8 protease (Figure 10A) and trypsin (data not shown) indicated 
that only the last 45 residues of the linker between the kinase domain and the first 
Polo-box were structured as part of the PBD (Figure 10A). Similar results were 
obtained using the C-terminal segment of Cdc5p (data not shown). We refer to the 
beginning of this additional region as the Polo-cap (Pc). For both Plkl and Cdc5p, 
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we found no significant difference in the phosphopeptide-binding affinities of 
fragments encompassing the entire C-terminal regions or the proteolytically- 
defined PBDs, indicating that the first ~ 40 amino acids between the kinase and 
the Pc plays no major role in peptide binding. Shorter fragments of both Plkl and 
5 Cdc5p encompassing just the Polo boxes, but lacking the Pc, were insoluble in E. 
coli, indicating a clear structural role for the Pc in both proteins, despite the 
absence of any extensive sequence homology between the two proteins in this 
region. 

The X-ray structure of a recombinant form of the proteolytically-defined 
10 Plkl PBD (residues 367-603) in complex with its 'optimal' phosphopeptide was 
solved by multiwavelength anomalous diffraction (MAD) using Se-Met- 
containing protein, and refined against native data extending to 1 .9A resolution 
(Table 4). 

Table 4 Crystallographic analysis 

Data Collection 

I*C«s!<>A) Native (0.98) Se<0.97838) Sc (0.97887) Se(0.95) 

14.1 • SRS 14.2.SRS 

d(A) 20.0-1.9 20.0-3 J 20.0-3.5 20.0*3.5 

Completeness (**•) 97.7 99.9 99 JO 99J2 

Redundancy' 3j6 3.7* -1.9* 

fl^f^) 5 5 J SA y 53? 4.9* 

Phasing analysis 

R**e4bin(A> 20-11.2 11.2.7.5 7.5-*.0 6.0-S.2 5.2-4.6 4.6-4.2 4.2.3.9 3JJL3.6 

POM 0.79 0.83 0.79 0.70 034) 0.53 0.4* 0.44 

Mem FOM . 0.60 

Refinement 
™WA) 

24.0 26.8 0.007 1.2 



: Rqm • Sj^cl> - I^S<I> nfcerc I, is (ho intensity of Chejfii reflection and <)> fa fce average intensity. 
1 Calcubied wtih Bijvaeb separated 

Rt», *uf« hid ctlcubttd on 5^ I of the data exefaded (ran fee refinement calculation. 
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The structure (Figure 10B) shows that the PBD contains two p 6 a motifs 
that comprise the two Polo-box regions (PB1 & 2) identified by sequence 
profiling. The atomic structural coordinates of this structure are provided in Table 
5. In spite of the fact that the amino-acid sequences of the two Polo-boxes within 
5 any one Plk exhibit only -20-25% sequence identity, the structures of the two 
motifs are quite similar (root mean square (rms) deviation of 77 Cot atoms of 
1.6A; Figure 10B). The two Polo-boxes pack together to form a 12-stranded p- 
sandwich flanked by three a-helical segments (Figure 10C). Although motifs 
resembling the Polo-box structure are represented in the Protein Databank, the 

10 overall domain structure represents a new protein fold. 

The Pc consists of an a-helical segment oiA, loop, and short 3 10 helix which 
connects to the N-terminal (3-strand of Polo-box 1 (pi) through a -10 residue 
linker region (LI). The Pc wraps around Polo-box 2 like a hook tethering it to 
Polo-box 1 . aA packs against aC from PB2 in an anti-parallel coiled-coil 

15 arrangement, while the 3 10 helix packs against the shorter aC\ The two Polo- 
boxes are connected by a second -30 residue linker sequence (L2) that is partially 
conserved. LI and L2 run in anti-parallel directions between the two Polo-box P- 
sheets. Thus, the hydrophobic core is formed from direct interactions of highly 
conserved non-polar residues predominantly located on pl/p2 from PB1 and p6/p7 

20 from PB2, together with an array of interactions with the intercalating linker 
regions. 

Novel PBB-Phosphopeptide Interactions are Crucial for Specificity 

The phosphopeptide binds in a largely extended conformation to a region of 
25 positive charge, located at one end of a shallow cleft formed between the two 
Polo-boxes (Figure 10). In all, -1000 A 2 of solvent accessible surface are buried 
by binding of the seven phosphopeptide residues that are visible in our electron 
density maps. Binding involves part of an extensive, highly conserved surface that 



-71- 



is located exclusively on the peptide-binding face of the PBD (Figure 1 1 A, 1 IB). 
This conserved surface coincides with the only significant region of positive 
electrostatic potential within the entire PBD (Figure 1 1C). Overall, the 
phosphopeptide interacts predominantly with pi from PB1, the N-terminal end of 
5 L2 and 08 and 9 from PB2. Hydrogen bonding interactions formed with the 
peptide side- and main-chain atoms alternate to some degree between residues 
within the two Polo-boxes, forming a zipper-like structure at the edge of the 
PB1/PB2 interface (Figure 11D). 

PBD binding to the phosphate moiety involves a combination of direct 

10 contacts with protein side-chains together with extensive indirect interactions 
through a well-defined lattice of water molecules, many of which are fully 
hydrogen-bonded (Figure 1 IE). In total, the phosphate group participates in eight 
hydrogen-bonding interactions explaining the critical dependence on peptide 
phosphorylation for binding (Elia et al., Science 299:1228-1231, 2003). The only 

15 residues that contact the phosphate group directly are His-538 and Lys-540 from 
PB2, whose side chains form a pincer-like arrangement that chelates the 01, 03, 
and Oy phosphate oxygens. 

The structural basis for the extraordinarily high selectivity for serine at the 
pThr-1 position results from a major difference in orientation of the bound 

20 phosphopeptide when compared with phosphopeptide complexes of 14-3-3 

proteins and FHA domains, the two major classes of pSer/pThr binding proteins 
(Durocheretal.,Mo/. Cell 6:1169-82,2000; Yaffeeta/., Cell 91:961-971, 1997). 
In these structures, the pThr-1 side-chain is solvent exposed and little selection is 
observed at this position. In contrast, the peptide orientation in the Plkl complex 

25 is inverted such that the Ser -1 side-chain is directed towards the Plkl surface 

(Figure 1 IB). In this orientation, it engages in two hydrogen bonding interactions 
with Trp-414 main-chain atoms, and one with the Leu-491 main-chain carbonyl 
via a water molecule (Figure 1 1C). Significantly, the Ser -1 Cp atom makes 
favourable van der Waals interactions with C8l from the Trp-414 indole side- 

-72- 



chain. This explains why even a conservative replacement of Ser with Thr at this 
position abrogates peptide binding (Figure 9 A), presumably due to a steric clash of 
the threonine y-methyl substituent with Trp-414. 

The critical role of Trp-414 in ligand binding revealed by our crystal 
5 structure (Figure 1 ID) explains the observation that a W414F mutation eliminates 
both centrosomal localization of Plkl and its ability to complement the cdc5-l ts 
mutation (Lee et al., Proa Natl Acad. ScL USA 95:9301-9306, 1998). Both of 
these effects are likely to be at least partly attributable to disruption of critical Ser- 
1 interactions with the PBD. In agreement with this, a mutant PBD containing the 

10 W414F substitution is severely compromised in phosphopeptide binding, with an 
affinity of > 100 [iM as determined by ITC. Loss of binding is unlikely to result 
from gross structural perturbation of the Polo-box fold, since the mutant PBD 
exhibits similar secondary structural content to the wild-type protein as judged 
from far UV CD spectra (data not shown). Furthermore, Trp-414 in Polo-box 1 is 

15 replaced by tyrosine in PB2 of both wild-type S. pombe Plol and S. cerevisiae 
Cdc5p PBD's, (Figure 1 1 A), showing that similar substitutions are naturally 
tolerated in a related structural context. 

Consistent with the oriented library selection, the protein-peptide interface 
is dominated by interactions of the PBD with the pThr and Ser-1 (Figure 1 1C, 

20 1 ID). Although we observed modest selection for Pro at the pThr+1 position, it 
appears from the structure that it does not contribute greatly to the binding 
interface, and multiple substitutions at this position are tolerated for peptide 
binding (Figure 9B). In the PBD structure, the /raw-proline introduces a kink 
after the Ser-pThr directing the peptide backbone back toward the binding surface, 

25 allowing the pThr+2 main chain amino group to contact the PBD. Thus, the +1 
Pro likely increases binding affinity by diminishing the entropic penalty for 
making this favorable backbone contact. This contrasts with structures of pSer- 
Pro peptide complexes of both the Pinl WW and the Cdc4 WD40 domains in 
which the Pro+1 side chain inserts into a hydrophobic pocket and makes coplanar 
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interactions with a buried tryptophan (Leung et al., Nat. Struct BioL 9:719-724, 
2002; Verdecia et al., Nat Struct Biol 7:639-643, 2000). 

Plkl and Sak Polo-boxes are Structurally Distinct - One Motif, Two Folds 
5 The human Plk family encompasses the canonical kinases (Plks 1-3) and 

Sak, which contains a highly homologous Ser/Thr kinase domain but only a single 
divergent Polo-box. Recent structural data has shown that the isolated Polo-box 
from murine Sak forms an intermolecular dimer, leading to the suggestion that 
tandem Polo-boxes in Plkl -related Plks may form a related, intra-molecular 

10 'dimeric' architecture (Leung et al, Nat. Struct. Biol 9:719-724, 2002). Our 
structure shows that this notion is broadly correct. In each case, the Polo-box 
repeat comprises a six-stranded p-sheet and a-helix. This structural unit 
associates with a second Polo-repeat via intra- or intermolecular interactions in 
Plkl and Sak respectively, to form (3-sandwich domain structures. However, 

15 closer examination reveals profound differences between the organizations of the 
two structures (Figure 12A and 12B). The p 6 ot topology of the Plkl Polo-box is 
replaced by a circularly-permuted psOtp topology in Sak. Consequently, Plkl pi 
has no equivalent in the Sak Polo-box sequence, and instead overlaps structurally 
with Sak P6. In addition, the Sak P-sheet is completed by a 'segment-swap' of p4 

20 & 5 between monomers. Most strikingly, the association of the two Polo-boxes 
differs completely such that residues forming the interface between Polo-repeats in 
the Sak homodimer are located largely on the exterior of the Plkl p-sandwich, 
where they partially form the interface with the flanking a-helical segments. 

25 Mutation of the His-Lys Pincer Abolishes Phosphopeptide Binding in vitro, 
Cdc25 Binding in vivo, and Centrosomal Localization of the Plkl PBD 

To verify that the key phosphothreonine-interacting residues identified in 
the X-ray crystal structure were indeed responsible for mediating phospho- 
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dependent interactions in vitro and in vivo, we mutated His-538 and Lys-540 of 
the pThr pincer motif, to either Ala and Met, or Glu and Met, respectively. These 
mutations severely disrupt phosphopeptide binding in solution as judged by the 
reduced binding of in vitro translated Plkl PBD to a bead-immobilized pThr-Pro 
5 oriented library (Figure 13 A) and by ITC (Figure 13B). 

During mitotic entry, Cdc2/Cyclin-B and Plkl cooperate to activate the 
dual specificity phosphatase Cdc25 through extensive phosphorylation of its N- 
terminus as part of an amplification loop for Cdc2/Cyclin-B activation (Abrieu et 
al.,J. Cell ScL 111:1751-1757, 1998; Hoffmann et al., EMBO J. 12:53-63, 1993; 

10 Izumi et aL, Mol Biol Cell 4:1337-1350, 1993; Izumi et al, Mol Biol Cell 6:215- 
226, 1995; Kumagai et al, Cell 70:139-151, 1992; Kumagai et al., Science 
273:1377-1380, 1996; Qian et al., Mol Cell Biol 19:8625-8632, 1999; Qian et 
al., Mol Biol Cell 12:1791-1799, 2001). Mitotically phosphorylated Cdc25C 
exhibits a large mobility shift on SDS-PAGE (Kumagai et al., Cell 70:139-151, 

15 1992). Cdc25C is phosphorylated on at least five Ser/Thr-Pro sites by 

Cdc2/Cyclin-B in vitro (Izumi et al., Mol Biol Cell 4: 1337-1350, 1993; Strausfeld 
et al, J. Biol Chem. 269:5989-6000, 1994). One of these sites, Thr-130, occurs 
within a near-optimal PBD binding motif, Leu-Leu-Cys-Ser-pThr-Pro-Asn. We 
previously observed that a GST-fusion of the isolated PBD could pull-down wild- 

20 type Cdc25C, but not a T130A or S129V Cdc25C mutant, from mitotically- 
arrested HeLa cell lysates. These data strongly suggested that Cdk priming of 
Thr-130 generates a binding site for the Plkl PBD to facilitate full activation of 
Cdc25C by subsequent Plkl -mediated phosphorylation (Elia et al., Science 
299:1228-1231, 2003). As shown in Figure 13C, expression of His-Xpress-tagged 

25 wild-type Plkl PBD in vivo results in a strong interaction with the mitotically 
phosphorylated form of endogenous Cdc25C in nocodazole-arrested HeLa cells. 
However, expression of the His-538/Lys-540 pincer mutants eliminates Cdc25C 
binding as also observed in cells transfected with a PBD construct lacking the 
second Polo-box. 
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To investigate whether the PBD plays a similar substrate-targeting role in 
the context of full-length Plkl, HeLa cells were transfected with myc-tagged wild- 
type or mutant constructs of full-length Plkl, and interactions between Plkl and 
endogenous Cdc25C examined in nocodazole-arrested cells using 
5 immunoprecipitation and Western blotting (Figure 13D). We observed a strong in 
vivo interaction between the mitotically upshifted form of endogenous Cdc25C 
with full-length Plkl in arrested cells that, somewhat surprisingly, was not 
increased when a kinase-dead Plkl mutant (K82R) or a double mutant 
incorporating a T210D mutation in the T-loop to further expose the kinase-binding 

10 cleft were employed as substrate traps. Conversely, mutation of the His-538/Lys- 
540 phosphate pincer mechanism in full-length Plkl completely disrupted the in 
vivo interaction between Plkl and Cdc25C demonstrating that the interaction of 
full-length Plkl with full-length Cdc25 in G2/M-arrested cells is mediated 
primarily through the PBD, rather than its associated the kinase domain. This 

1 5 result is important since it directly demonstrates a requirement for PBD 

phosphopeptide-binding in substrate targeting in the context of the full-length Plkl 
molecule. 

Finally, we observed that mutation of the His-538/Lys-540 pincer 
eliminates targeting of the Plkl PBD to centrosomes in permeabilized prophase- 

20 arrested cells (Figure 6). This finding suggests that the localization of Plkl to 
centrosomes observed in vivo (Jang et al, Proc. Natl Acad. Set USA 99:1984- 
1989, 2002; Lee et aL, Proc. Natl Acad. Sci. USA 95:901-9306, 1998) results 
from direct interactions between the PBD and phosphorylated centrosomal 
components. In summary, the results in Figures 13 and 14 show conclusively that 

25 the structurally defined His-538/Lys-540 pincer mechanism that is responsible for 
mediating phosphopeptide binding in vitro, plays a similar critical role in substrate 
targeting in vivo. 
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Phosphodependent Substrate Recognition is Necessary for the Disruption of 
Mitotic Progression by the Isolated Plkl PBB 

Since the PBD is necessary for targeting Plkl to primed substrates, its 

overexpression might be expected to act in a dominant-negative fashion to inhibit 

5 correct localization of endogenous Plkl and, therefore, disrupt Plkl function in 

vivo. Indeed, overexpression of the C-terminus of Plkl has been shown to cause 

mitotic arrest and induce formation of randomly oriented, disorganized spindles 

(Jang et al, Proc. Natl. Acad. Set USA 99:1984-1989; Seong et al., J. Biol Chem. 

277:32282-32293, 2002). The X-ray structure of the PBD-phosphopeptide 

10 complex now enables us to dissect the role of phospho-specific binding in this 

phenotype. In agreement with previous studies, we found that overexpression of a 
GFP-fusion of the Plkl PBD in HeLa cells caused a dramatic increase in the 
population of cells in G2/M (60% for PBD-GFP- vs. 17% for GFP-expressing 
cells) (Figure 15). Importantly, this accumulation of mitotic cells was abolished 

15 by mutation of His-538 and Lys-540 (23% in G2/M). In addition, expression of 
the wild-type PBD-GFP construct induced aneuploidy in HeLa cells, evident as a 
peak of cells with DNA content >4N, in agreement with anti-Plkl antibody 
microinjection studies reported by Lane and Nigg (Lane et al., J. Cell Biol 
135:1701-1713, 1996). However, this effect was completely lost when the 

20 His/Lys pincer mutant was employed. The dominant negative effects strongly 
suggest that phosphopeptide-binding by the PBD in full-length Plkl normally 
plays a role in both proper mitotic progression and in the establishment of a 
functional bipolar spindle to ensure equal chromosome segregation. 

25 Phosphopeptide Binding to the PBD Stimulates Plkl Kinase Activity 

Lee and Erikson (Lee et al, Mol Cell Biol 17:3408-3417, 1999) and 
Mundt et al. (Biochem. Biophys. Res. Commun. 239:377-385, 1997) observed that 
deletion of the C-terminus of Plkl increased the kinase activity -3 -fold while Jang 
et al (Jang et al., Proc. Natl. Acad. Sci. USA 99:1984-1989, 2002) found that the 
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isolated Plkl C-terminus interacts with and inhibits the activity of the isolated 
kinase domain towards the exogenous substrate casein. We observed the 
complementary result, namely that the kinase domain appears to inhibit 
phosphopeptide binding by the PBD. While the isolated Plkl PBD binds strongly 
5 and specifically to pSer/pThr-containing peptides (Figure 13 A), phosphopeptide 
binding by the PBD within full-length Plkl is reduced at least 10-fold, and is 
considerably less phospho-dependent (Figure 16A, wt lanes). The phospho- 
specific binding component of full-length Plkl is clearly mediated by the PBD 
(Figure 16A, compare wt pTP and TP lanes with H538A/K540M pTP and TP 

10 lanes). This suggested that a mutually inhibitory interaction exists between the 
Plkl PBD and the kinase domain in full-length Plkl. 

We wondered whether binding of the PBD to phosphopeptides was 
sufficient to relieve this intramolecular interaction and stimulate the activity of the 
kinase domain towards exogenous substrates. Baculovirally-produced Plkl was 

15 therefore incubated with either the optimal PBD phosphopeptide or its non- 

phosphorylated counterpart and kinase activity towards casein measured by SDS- 
PAGE/autoradiography . As shown in Figure 1 6B, addition of the optimal PBD 
phosphopeptide increased Plkl kinase activity by a factor of 2.6, while addition of 
the non-phosphorylated peptide had no effect. This result compares quite 

20 favourably with the - 2.5-fold stimulation of Src and Hck kinase activity that is 
observed when these full-length Src family kinases are incubated with their 
optimal SH2 -binding phosphotyrosine peptides to relieve SH2-mediated inhibition 
of the kinase domain (Liu et ah, Oncogene 8: 1 1 19-1 126, 1993; Moarefi et al, 
Nature 385:650-653, 1997). Thus, our results for Plkl suggested that binding of 

25 the PBD to primed phosphorylation sites not only serves to target the kinase 
domain to substrates but also simultaneously activates the kinase domain for 
substrate phosphorylation by relieving an inhibitory intramolecular interaction 
(Figure 16C). 
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In this study, we have elucidated a conserved phosphopeptide-binding 
motif that is recognized by the PBDs of all canonical members in the human Plk 
family, Xenopus Plxl and S. cerevesiae Cdc5p. The high-resolution X-ray 
structure of the Polo-box domain bound to an optimal phosphothreonine peptide, 
5 provides a molecular rationale for motif selection, defines a new protein fold, and 
illustrates a unique mechanism for phospho-dependent ligand binding involving 
the participation of ordered solvent molecules, together with a conserved His/Lys 
pincer motif. We have identified a pSer/Thr-dependent mechanism of Plk 
activation in which intramolecular inhibition of the kinase by the PBD is relieved 
10 by PBD interaction with pre-phosphorylated binding targets. 

Structural Definition of the Polo-box Domain: A General Phosphoprotein 
Recognition Module 

Previous reports have described the presence of 1-3 Polo-boxes within the 

15 C-terminal regions of Polo-like kinases (Glover et al., Genes Dev. 12:3777-3787, 
1998; Glover et al, J. Cell Biol 135:1681-1684, 1996; Nigg, Curr. Opin. Cell 
Biol 10:776-783, 1998; Seong et al, J. Biol Chem. 277:32282-32293, 2002). Our 
structure now definitively shows that the PBD consists of two structurally 
homologous regions corresponding to two conserved Polo-box sequences. 

20 Phosphopeptide binding occurs at the interface of the two Polo-boxes, 

rationalizing both the observed 1 : 1 stoichiometry of PBD/ligand binding (Figure 
5B) and the requirement for both Polo-boxes for efficient subcellular localization 
of Plkl in vivo (Seong et al, J. Biol Chem. 277:32282-32293, 2002). Polo-box 
Domains (PBDs) now join an expanding family of 

25 phosphoserine/phosphothreonine binding domains that includes 14-3-3 proteins, 
WW, FHA, WD40, and Smad MH2 domains (Yaffe et al, Curr Opin Cell Biol 
13:131-138, 2001; Yaffe et al. Structure 9:R33-38, 2001). In contrast to other 
more ubiquitous phosphodependent binding modules, PBDs occur only in Polo- 
like kinases where they localize Plks to specific subcellular organelles and mitotic 
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structures (Jang et al., 2002; Lee et al., Proc. Natl Acad. ScL USA 95:9301 -9306, 
1998; (Lee et al., Mol Cell Biol 17, 3408-3417, 1999) and target the kinase to 
substrates that have been primed by prior phosphorylation. 



5 Common Phosphopeptide Motif Selection by the PBD family 

In higher eukaryotes, different Plk family members function at different 
points in the cell cycle (Donaldson et al., 2001; Glover et ah, Genes Dev 12:3777- 
3787, 1998; Glover et al, J Cell Biol 135, 1681-1684, 1996; Ma et al, Mol 
Cancer Res 1, 376-384, 2003; Nigg, Curr Opin Cell Biol 10:776-783, 1998) or 

10 play antagonistic roles in response to DNA damage (Bahassi et al, Oncogene 21, 
6633-6640, 2002; Smits et al, Nat Cell Biol 2:672-676, 2000; Xie et al., Cell 
Cycle 1 :424-429, 2002). Given the similarity in the selected motifs with a Ser- 
pSer/pThr-Pro/X core for these three proteins, potential mechanisms to separate 
Plks within a single organism achieve substrate specificity might include different 

15 substrate selectivities by their respective kinase domains, spatially and temporally 
restricted activation of Plks by upstream kinases, or the well documented cell- 
cycle regulation of Plkl and 2 expression (Golsteyn et al., Cell Sci 107:1509-1517, 
1994; Lee et al, 1995; Ma et al, Mol Cancer Res 1 :376-384, 2003). One pathway 
in which such specificity must be vital is the DNA damage response, since Plkl is 

20 inhibited by DNA damage (Smits et al., Nat Cell Biol 2:672-676, 2000), while 
Plk3 appears to be activated (Xie et al., Cell Cycle 1:424-429, 2002). 

In addition to pThr-1 selectivity for serine, all PBDs that we have 
examined exhibit moderate specificity for proline at the pThr+1 position, 
emphasizing a central role for CDKs and other proline-directed kinases in priming 

25 substrates for Plkl targeting. Several lines of evidence support this model. For 
example, maximal Plkl -induced activation and nuclear translocation of Cdc25 has 
been shown to require cyclin B coexpression (Toyoshima-Morimoto et al., EMBO 
Rep. 3:341-348, 2002). Furthermore, full reconstitution of purified APC activity 
requires prior synergistic phosphorylation of the APC by both Cdc2 and Plkl 
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(Golan et al., J, Biol Chem. 277:15552-15557, 2002). Interestingly, the backbone 
torsion angles of the trans-pvoYme in the Plkl -bound phosphopeptide are very 
similar to those of the equivalent Pro residue in the ternary 
cyclinA3/CDK2/peptide complex structure (Brown et al., Nat Cell Biol 1:438- 
5 443, 1999). Thus, the conformation of the peptide in the PBD complex reflects 
not only the structural requirements for Plk interaction but also the requirements 
for the initial priming phosphorylation. 

Nevertheless, a clear tolerance for residues other than proline demonstrates 
that other mitotic kinases may also serve as priming agents. In this regard, the 
10 NIMA-related kinase Finl has been recently shown to increase Plol affinity for 
spindle pole bodies in S. pombe (Grallert et al., EMBOJ. 21:3096-3107, 2002). 
Identification of substrates for Plk family members, as well as the kinases 
involved in substrate priming is, therefore, important. 

1 5 The Structural Basis of Phosphopeptide Binding 

The PBD binds to phosphorylated epitopes in a way that is distinct from 
that observed previously in structures of other protein-phosphopeptide complexes 
(Yaffe et al., Structure 9:R33-38, 2001). These differences include the His/Lys 
pincer, a significant contribution from bridging water molecules and an unusual 

20 orientation of the pThr-1 residue that is directed toward the protein-binding 

surface. Although stereospecific, solvent-mediated binding has been described in 
other systems, 'solvent-bridged' interactions with the phosphoryl group have not 
been observed in any structures of protein-phosphopeptide complexes reported to 
date. Rather, the phospho moiety is always held by direct interactions, most often 

25 with highly conserved arginine side-chains (Eck et al., Nature 362:87-91, 1993; 
Waksman et al., Nature 358:646-653, 1992; Yaffe et al., Structure 9:R33-38, 
2001). The importance of the His/Lys pincer in the Plkl PBD structure is 
exemplified by our observations that its mutation abrogates phosphopeptide 
binding by the PBD in vitro, targeting of Plkl to Cdc25C in vivo, and centrosomal 
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localization, as well as disrupt the ability of the isolated PBD to induce G2/M 
arrest and aberrant spindle function. 

Structure-based sequence alignments (Figure 12B) show that the binding 
surface formed at the interface of the two Polo-boxes is the only totally conserved 
5 region in the PBD, further supporting our finding that the PBDs from different 
Plks generally select very similar optimal phosphopeptide binding motifs. Crucial 
hydrogen-bond interactions and van der Waals contacts with Trp-414 of Plkl 
rationalize both the strong serine selection at the (pThr/pSer)-l position and the 
fact that mutation of Trp-414 disrupts Plkl function in vivo (Lee et al., Proc. Natl. 
10 Acad. ScL USA 95:9301-9306, 1998). The absolute conservation of Trp-414 

predicts that all family members should exhibit the same serine preference, and we 
now show that this is the case. Historically, the 10 amino acid sequence 
surrounding Trp-414 was considered the signature motif for the non-catalytic 
region of Polo-family kinases (Golsteyn et al., Cell ScL 107:1509-1517, 1994). 

15 

Comparison of the Plkl PBD and Sak Polo-box Structures 

The Plkl PBD and Sak Polo-box structures emphasize how related 
sequence motifs are able to form markedly different protein folds. Significant 
structural differences between homologous proteins have been observed only 

20 rarely and most prominently in the KH family of small RNA-binding domains 
(Grishin, Nucleic Acids Res. 29:638-643, 2001 and references therein). In this 
case, two distinct sub-families of structures are distinguishable by different 
topologies of a and p secondary structural elements although all share a related 
hydrophobic core and similar overall tertiary structure. The differences between 

25 the Plkl PBD and Sak Polo-box are more extreme and emphasize how related 
sequence motifs are able to form markedly different protein folds. This, in turn, 
has considerable implications for both motif-based structure prediction and efforts 
to delineate biological function from structures of apparently homologous 
proteins. 
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How do these unexpected structural differences relate to PBD function in 
Plkl and Polo-box function in Sak subfamily Plks? The grossly different 
architectures argue against conservation of the phosphoprotein-binding function 
since residues most intimately involved in phosphopeptide binding by Plkl (e.g. 
5 His-538/Lys-540, Trp-414) are not conserved in Sak. Furthermore, examination 
of the electrostatic potential surface of the Sak Polo-box dimer shows no 
significant regions of positive charge (data not shown), a property otherwise 
common to phospho-dependent binding proteins. 

10 A Model for Phospholigand-Induced Stimulation of Plk Kinase Activity 

Two alternative models for intramolecular regulation of kinase activity by a 
phosphopeptide binding domain are exemplified by the mechanisms of SH2 
domain-mediated inhibition in Src family kinases and SHP-family tyrosine 
phosphatases. In the Src-type model, the phosphopeptide binding cleft of the SH2 

15 domain engages an internal phosphotyrosine motif at the C-terminus of the 
molecule to hold the kinase domain in an inactive conformation (Sicheri et al., 
Nature 385:602-609, 1997; Xu et al. Nature 385:595-602, 1997). We believe that 
Plkl does not operate through this mechanism since it does not possess an internal 
optimal PBD binding site, and interaction of the PBD with the Plkl kinase domain 

20 is not dependent on phosphorylation (Jang et al, Proc. Natl. Acad. ScL USA 

99:1984-1989, 2002). In fact, mutation of Thr-210 to Asp as a mimic of kinase 
activation loop phosphorylation, actually abolishes PBD binding (Jang et al, Proc. 
Natl. Acad. Sci. USA 99:1984-1989, 2002). Furthermore, mutation of Trp-414 in 
Polo-box 1 has been shown to have no effect on the basal level of Plkl kinase 

25 activity (Lee et al, Proc. Natl. Acad. Sci. USA 95:9301-9306, 1998). Since 

mutations at this position disrupt phosphodependent PBD interactions, it would 
seem that kinase regulation occurs through a phospho-independent binding 
function of the PBD. 
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In the SHP2 model, binding of the back surface of the N-terminal SH2 
domain to the phosphatase domain partially occludes the catalytic cleft and 
simultaneously deforms the SH2 domain's binding pocket to reduce its affinity for 
phosphopeptide ligands (Hof et al., Cell 92:441-450, 1998). This is entirely 
5 consistent with the reduced phosphopeptide binding that we observe for the PBD 
in the context of full-length Plkl (Figure 8A, 8C). In the case of SHP2, high local 
concentrations of phosphotyrosine ligands are able to bind to the N-terminal SH2 
domain, inducing a concomitant conformational rearrangement of the SH2 binding 
cleft that is transmitted to its phosphatase-interacting surface and releases the 

10 catalytically competent phosphatase domain. We believe Plks may be regulated 
by a related mechanism (Figure 8C). Some support for the SHP-like mechanism 
arises from our observation that the N-terminal Polo-box of one molecule in the 
crystallographic asymmetric unit that is not involved in extensive lattice contacts 
displays significantly higher temperature factors than its C-terminal counterpart 

15 (58A 2 vs 37A 2 ). This implies a rather dynamic association of the two Polo-boxes 
that is likely to be more pronounced in the absence of the phosphopeptide ligand. 
In our current model, binding of the phosphopeptide between the N- and C- 
terminal Polo motifs acts as a structural switch, stabilizing a conformation of the 
PBD that is inappropriate for association with the kinase domain. Subsequent 

20 T2 10D phosphorylation by upstream kinases would then serve to maintain the 

active state by preventing re-binding of the PBD to the kinase. Definitive proof of 
this mechanism will require the determination of structures of full-length Plk's and 
their complexes. This work is in progress. 

It is clear that proper mitotic progression requires the highly regulated 

25 interplay between CDK's and a variety of other proteins kinases such as Aurora, 
NIMA, and Polo-like kinases, yet the molecular events that underlie the activity of 
many of these enzymes are largely unknown. The results of our integrated 
biochemical, structural and cell-biological approach now provide a framework 
within which the cellular function of the Polo-box motif can be understood. Plkl 
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is overexpressed in a variety of human tumors (Strebhardt et al., JAMA 283:479- 
480, 2000; Takai et al., Cancer Lett. 169:41-49, 2001), and down-regulation of 
human Plkl has been shown to inhibit proliferation of cultured tumor cells (Elez et 
al., Biochem. Biophys. Res. Commun. 269:352-356, 2000; Liu et al., Proc. Natl. 
5 Acad. Sci. USA 100:5789-5794, 2003), suggesting that Plks are potentially 

important targets for therapeutic intervention. Here, we have shown that the Plkl 
PBD binds to phosphorylated epitopes in a way that is distinct from any observed 
previously in structures of other protein-phosphopeptide complexes. The unique 
pattern of interactions with the Ser-pThr dipeptide suggest this motif may be 
10 employed as a useful template for the design of anti-proliferative inhibitors 

specifically directed against Polo-box domains. The experiments described above 
were carried out using the following methods. 

Phospho-motif screen for phosphoserine/threonine binding domains 
15 A phospho-motif-biased peptide library and its unphosphorylated 

counterpart were constructed as follows: biotin-Z-Gly-Z-Gly-Gly-Ala-X-X-B-X- 
pThr-Pro-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:40 and biotin-Z-Gly-Z-Gly- 
Gly-Ala-X-X-B-X-Thr-Pro-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:41, where 
pThr is phosphothreonine, Z indicates aminohexanoic acid, X denotes all amino 
20 acids except Cys, and B is a biased mixture of the amino acids P, L, I, V, F, M, W. 
Streptavidin beads (Pierce, 75pmol /jliL gel) were incubated with a five-fold molar 
excess of each biotinylated library in 20 mM Tris/HCl (pH7.5), 125 mM NaCl, 
0.5% NP-40, 1 mM EDTA and washed four times with the same buffer to remove 
unbound ligand. The bead-immobilized libraries (30 \\L gel) were added to 6 |iL 
25 of an in vitro translated [ 35 S]-labeled protein pool in 200 binding buffer (20 
mM Tris/HCl (pH7.5), 125 mM NaCl, 0.5% NP-40, 1 mM EDTA, 1 mM DTT, 4 
jig/mL pepstatin, 4 |ug/mL aprotinin, 4 |ig/mL leupeptin, 200 fiM Na 3 V0 4 , 50 mM 
NaF). Each pool consisted of -30 radiolabeled proteins produced by coupled in 
vitro transcription/translation (Promega) of a plasmid pool containing -100 cDNA 
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clones from a unidirectional and oligo dT-primed human HeLa cell library in 
pCDNA3.1 (Kanai et ai, EMBOJ 19:6778-6791, 2000). After incubation at 4°C 
for 2-3 hours, the beads were rapidly washed four times with binding buffer prior 
to separation on SDS-PAGE (1 1.4%) and autoradiography. Positively scoring hits 
5 within pools were recognized as protein bands that interacted more strongly with 
the phosphorylated immobilized library than its unphosphorylated counterpart. 
Pools containing positively scoring clones were progressively subdivided using a 
96-well format and re-screened for phospho-binding until single clones were 
isolated and identified by DNA sequencing. 

10 

Cloning, expression, and purification of Plk-1 PBD proteins 
For deletion mapping of the PBD, C-terminal fragments of Plk-1 were 
generated by PCR and cloned into the EcoRI and Xhol sites of pCDNA3.1 
(Invitrogen). For production of recombinant PBD as a GST fusion in bacteria, the 

15 326-603 fragment of Plk-1 was ligated into the EcoRI and Xhol sites of pGEX-4T 
(Pharmacia), transformed into BL21, and induced in late log-phase cells at 37°C 
for 3.5 hours in the presence of 0.4 mM IPTG. For measurements of peptide 
binding affinity by ITC, GST-Plk-1 (326-603) was isolated from bacterial lysates 
using glutathione agarose, cleaved from GST using thrombin (lOU/mL), and 

20 purified by anion exchange chromatography (Q Sepharose HP, Pharmacia). 

Peptide Library Screening 

Phosphothreonine- and phosphoserine-oriented degenerate peptide libraries 
containing the sequences Met-Ala-X-X-X-X-pThr-Pro-X-X-X-X-Ala-Lys-Lys- 
25 Lys SEQ ID NO:42 (theoretical degeneracy (td) =1.7 x 10 10 ), Met-Ala-X-X-X-X- 
pThr-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:43 (td = 1.7 x 10 10 ), Met-Ala-X-X- 
X-X-Ser-pThr-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:44(td = 1.7 x 10 10 ), Met- 
Ala-X-X-X-pSer-Pro-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:45 (td = 4.7 x 10 7 ), 
Met-Ala-X-X-X-X-pSer-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:46 (td = 1.7 x 
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10 ,u ), and Met-Ala-X-X-X-X-Ser-pSer-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID 
NO:47(td= 1.7 x 10 10 ) were synthesized using N-oc-FMOC-protected amino acids 
and standard BOP/HOBt coupling chemistry. Peptide library screening was 
performed using 100 ^il of glutathione beads containing saturating amounts of 
5 GST-Plk-1 (residues 326-603) fusion protein (-1-1.5 mg) as described in Yaffe & 
Cantley (Methods EnzymoL, 328:157-170, 2000). Beads were packed in a 1 mL 
column and incubated with 0.5 mg of the peptide library mixture for 10 minutes at 
room temperature in PBS (150 mM NaCl, 3 mM KC1 3 10 mM Na 2 HP0 4 , 2 mM 
KH 2 P0 4 , pH 7.2). Unbound peptides were removed from the column by two rapid 

10 washes with PBS containing 0.5% NP-40 and two subsequent washes with PBS. 
Bound peptides were eluted with 30% acetic acid for 10 minutes at room 
temperature, lyophilized, resuspended in H 2 0, and sequenced by automated 
Edman degradation on a Procise protein microsequencer. Selectivity values for 
each amino acid were determined by comparing the relative abundance (Mole 

15 percentage) of each amino acid at a particular sequencing cycle in the recovered 
peptides to that of each amino acid in the original peptide library mixture at the 
same position. 

Isothermal Titration Calorimetry 

20 Peptides were synthesized by solid phase technique with two C-terminal 

lysines to enhance solubility, purified by reverse phase HPLC following 
deprotection, and confirmed by MALDI-TOF 9 Matrix-assisted laser 
desorption/ionisation-time of flight mass spectrometry. Some peptides contained 
an additional tyrosine residue to facilitate concentration determination by optical 

25 absorbance. Calorimetry measurements were performed using a VP-ITC 

microcalorimeter (MicroCal Inc., Studio City, CA). Experiments involved IO^iL 
injections of peptide solutions (150 |iM-180 \iM) into a sample cell containing 
\5\lM Plk-1 PBD (residues 326-603) in 50 mM Tris/HCl (pH 8.1), -200 mM NaCl, 
2 mM TCEP. Thirty injections were performed with a spacing of 240 s and a 
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reference power of 25 |iCal/s. Binding isotherms were plotted and analyzed using 
Origin Software (MicroCal Inc. Studio City, CA). 

Plk-1 PBD binding to cellular substrates 
5 HeLa cells were arrested in interphase or G2/M by treatment with 

aphidicolin (5 |ig/mL) or nocodazole (50 ng/mL), respectively, for 16 hours. Cells 
were lysed in 25 mM Tris/HCl (pH 7.5) containing 125 mM NaCl, 0.5% NP-40, 5 
mM EDTA, 2 mM DTT, 4 ^ig/mL pepstatin, 4 ng/mL aprotinin, 4 ^ig/mL 
leupeptin, 1 mM Na 3 V0 4 , 50 mM NaF 5 and 1 jliM microcystin, and 1 50 ^gs of 

10 lysate incubated with 10 |iL of glutathione agarose beads containing 2-5 jig of 
GST-Plk-1 (residues 326-603), GST-Pin 1, or GST for 30 minutes at 4°C. Beads 
were washed four times with lysis buffer. Precipitated proteins were eluted in 
sample buffer and detected by blotting with monoclonal MPM-2 (Upstate 
Biotechnology, Inc.) or polyclonal anti-Cdc25C (Santa Cruz Biotechnology, Santa 

15 Cruz, California). For peptide competition experiments, GST-Plk-1 (residues 326- 
603) was immobilized on glutathionine beads and preincubated with 320 jiM of 
PoloBoxtide-optimal, -8T, or -7V for 45 minutes at 4°C. For binding experiments 
involving mutant cdc25C, HeLa cells were transfected with wild-type and mutated 
versions of HA-tagged Cdc25C in pECE using Superfect (Qiagen, Valencia, CA). 

20 Nocodazole (50 ng/mL) was added seventeen hours after transfection and cells 
incubated for an additional 14 hours to arrest them in G2/M. Point mutations of 
Cdc25C were constructed using the QuickChange site-directed mutagenesis 
system (Stratagene) and verified by DNA sequencing. 

25 Centrosomal localization of the Plk-1 PBD 

U20S cells were cultured in 8-well chamber slides and arrested at G2/M by 
treatment with nocodazole (50 ng/mL) for 14 hours. After rinsing with PBS, cells 
were incubated with 4 \iM GST-Plk-1 PBD (residues 326-603) and Streptolysin-0 
(1 U/ml) in permeabilization buffer (25 mM HEPES (pH 7.9), 100 mM KC1, 3 
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mM NaCl, 200 mM sucrose, 20 mM NaF, 1 mM NaOV0 4 ) for 20 minutes at 
37°C. Cells were fixed in 3% paraformaldehyde/2% sucrose for 10 minutes at 
room temperature and extracted with a 0.5% Triton X-100 solution containing 20 
mM Tris-HCl (pH 7.4), 50 mM NaCl, 300 mM sucrose, and 3 mM MgCl 2 for 10 
5 minutes at RT. Slides were stained with Alexa Fluor 488-conjugated anti-GST 
(Molecular Probes, Eugene, OR) and monoclonal anti-y-tubulin (Sigma, St.Louis, 
MO) antibodies at 4°C overnight, then stained with a Texas Red conjugated anti- 
mouse secondary antibody for 60 minutes at room temperature and counterstained 
with 4 |ig/ml DAPI. Cells were examined using a Nikon Eclipse E600 

10 fluorescence microscope equipped with a SPOT RTcamera and software 

(Diagnostic Instruments, Livingston, Scotland). Images were analyzed using NIH 
Image. For peptide competition experiments, the GST-Plk-1 PBD solution was 
preincubated with 250 jiM of its optimal phosphopeptide ligand (PoloBoxtide- 
optimal) or its unphosphorylated counterpart (PoloBoxtide-8T) for 15 minutes at 

1 5 room temperature prior to use. 

To quantitate centrosomal localization of the GST-Plk-1 PBD relative to y- 
tubulin, black and white images of single cells showing comparable overall 
intensity for Alexa Fluor and Texas Red were selected and scaled to an average 
grayscale value of 200 (1= white, 255=black). The normalized intensity of 

20 centrosome-specific Alexa Fluor 488 staining (N.I. A F488) or Texas Red staining 
(NJ.tr) above background was defined as ([I ce ntrosome-Iceii]/Iceii) where I cen trosome 
indicates the fluorescence intensity of either Alexa-Fluor 488 or Texas Red 
averaged over the centrosome and I cen indicates the overall fluorescence intensity 
averaged over the entire cell. The relative GST-PBD/y-tubulin specific staining 

25 was then calculated as N J.af48s/N.Ltr- 

Screens to Identify Novel Binding Pairs 

Novel binding pairs can be identified by the methods of the invention. For 
example, phosphopeptides are generated that are biased to include MAP kinase 
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and Cell-cycle dependent kinase (Cdks) consensus phosphorylation sites (i.e., 
pSer-Pro), for use in screening for novel pSer-Pro binding polypeptides. Such a 
screen can be easily adapted to identify additional binding pairs. By taking 
advantage of the observation that protein kinases and phosphopeptide binding 
5 domains appear to co-evolve to recognize overlapping sequence motifs, 

phosphopeptides can be generated to follow specific protein kinase substrates. 
Thus, basophilic phosphopeptides having a core sequence including 
RXRSX[pS/pT] (where R is arginine, pS is phosphoserine, pT is 
phosphothreonine, and X is any amino acid) can be used to identify novel binding 

10 partners dependent on the kinase, Akt. Other potential basophilic kinase 

substrates based on consensus phosphorylation sequences of protein kinase C 
(PKC), cAMP-dependent protein kinase (PKA), G-protein coupled receptor 
kinases such as 0-ARK may also be used. 

Several methods are known in the art to identify consensus kinase 

15 substrates, for example, in U.S.P.N. 5,532,167, U.S.P.N. 6,004,757, and WO 
98/54577. Thus, degenerate phosphopeptides can be generated based on 
consensus kinase substrate peptide motifs. Exemplary kinase substrate peptide 
motifs that can be used include, without limitation, phosphopeptides derived from 
the consensus sequences of the serine/threonine kinases, Ca 2+ /calmodulin 

20 dependent kinases (CaMKs), check point kinases (e.g. CHK, Rad53), myosin light 
chain kinases, DRAK, Trio, casein kinase 1, cell cycle dependent kinases (CDKs, 
e.g., Cdc2, Cdk4, Cdk6), glycogen synthase kinases (GSK), MAP kinases (e.g., 
Jnk, Erk, p38), STE family kinases (e.g., PAK, GCK/MAP4K), MAP kinase 
activated kinases (e.g., Mnk), eIF2cx kinases (e.g., PERK, PKR, HRI, GCN2), Raf 

25 kinases (e.g., A-Raf, B-Raf), casein kinase II, aurora/Polo kinases, mixed lineage 
kinases (e.g., MLK1, -2, -3), AKAP, Activin-receptor like kinase (Kir4), CAK, 
Mos, Pirn, and Ksr. Other kinase substrate-derived phosphopeptide sequences that 
can be used in the invention include those derived from the dual specificity 
kinases, WEE-1, MEKs, DYRKs, Tesk, Clk, HIPK, Mps-1, TSK, and C-TAK. 
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Dual specificity kinases also include polypeptides related to the lipid kinases 
FRAP, pi 10 PI3 Kinase, ATM, ATR, and DNA-PK. 

Protein tyrosine kinase substrate peptide motifs can also be used in the 
invention and include phosphopeptides derived from the consensus substrate 
5 sequences of the receptor tyrosine kinases, which include the EGF-R family (e.g., 
EGF-R, Her2/Neu), PDGF-R, CSF-R, IGF-R, VEGF-R (e.g., Flk/Kdr, Fit), HGF- 
R (Met), NGF-R (e.g., TrkA, -B, -C), FGF-R, ROR, Tie-1, Tie-2/Tek, Eph (e.g., 
EphA]. 8 , EphBj.e), Rik, Ron, Ros, Ret, and from the cytoplasmic tyrosine kinases, 
which include, the Src family (e.g., Src, Lck, Lyn, Fyn, Hck, Yes), Abl, Csk, 

10 CTK, JAKs, FAK, ITK, BTK, Ack/Pyk, Tec, Tyk, Syk, Zap70, Fer, and Fes/Fps. 
Binding pairs identified are not limited to those that include 
phosphopeptide binding domains. The methods of the invention may be used to 
identify virtually any peptide-binding domain in which the domain is identified by 
simultaneous screening of a protein/polypeptide expression library with a biased 

15 peptide library. For example, a screen for binding pairs is carried out to identify a 
peptide-binding domain, for example, a PDZ, SH3, or WW peptide binding 
domain. The "bait" peptide library contains a degenerate collection of peptides 
oriented around at least two or more fixed residues. A working example of such a 
screen is provided in the upper left panel of Figure 9B, where there is a band at 

20 -24 kDa that binds the non-phosphopeptide library but not the phosphopeptide 
library., suggesting that it is specific for binding to BxTP motifs. 

Cloning and Expression of PBD Proteins 

C-terminal fragments of human Plkl (residues 326-603), human Plk2 
25 (residues 355-685), human Plk3 (residues 335-646), Xenopus Plxl (residues 317- 
598), and Saccharomyces cerevesiae Cdc5p (residues 357-705) were amplified 
from IMAGE cDNA clones or directly from S. cerevisiae chromosomal DNA by 
PCR and ligated into suitably digested pGEX4T-3 or pGEX-6Pl (Pharmacia). 
Proteins were expressed in E. coli BL21(DE3) cells and purified by glutathione- 
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affinity chromatography. For measurements of peptide binding affinity and 
domain mapping experiments, proteins were cleaved from GST with either 
thrombin or viral protease 3C (Pharmacia-LKB, Peapack, NJ) and further purified 
by anion exchange chromatography (Q Sepharose HP, Pharmacia) or gel filtration 
5 (Superdex S-75, Pharmacia, Peapack, NJ). 

Oriented Peptide Library Screening 

Phosphothreonine-oriented degenerate peptide libraries containing the 
sequences Met-Ala-X-X-X-X-pThr-Pro-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID 

1 0 NO:48 (theoretical degeneracy (td) = 1.7 x 1 0 10 ) and Met-Ala-X-X-X-X-Ser-pThr- 
X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:49 (td = 1.7 x 10 10 ) were synthesized 
using N-a-FMOC-protected amino acids and standard BOP/HOBt coupling 
chemistry. Peptide library screening was performed using 100 )^1 of glutathione 
beads containing saturating amounts (~1-1.5 mg) of GST-hPlkl , GST-hPlk2, 

15 GST-hPlk3, GST-Plxl, or GST-Cdc5p as described previously (Yaffe et al, 
Methods Enzymol 328:157-170, 2000). 

Peptide Binding Measurements 

Peptides were synthesized by solid phase technique with two C-terminal 

20 lysines to enhance solubility. Some peptides contained an additional tyrosine 

residue to facilitate concentration determination by optical absorbance. Isothermal 
titration calorimetry was performed using a VP-ITC microcalorimeter (MicroCal 
Inc. Studio City, CA) by titration of 15-40|iM solutions of PBD proteins with 30 x 
10 \i\ injections of 150-400jiM peptide in a starting volume of 1.4-2.0 ml. Binding 

25 isotherms were plotted and analyzed using Origin Software (MicroCal Inc. Studio 
City, CA). Binding of in vitro translated Plkl PBD (wild type and mutants) to 
bead-immobilized pTP and TP peptide libraries was performed as described 
previously (Elia et al., Science 299:1228-1231, 2003). pTP and TP indicate the 
peptide libraries biotin-Z-Gly-Z-Gly-Gly-Ala-X-X-B-X-pThr-Pro-X-X-X-X-Ala- 
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Lys-Lys-Lys SEQ ID NO:50 biotin-Z-Gly-Z-Gly-Gly-Ala-X-X-B-X-Thr-Pro-X- 
X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:51 5 respectively, where pThr is 
phosphothreonine, Z is aminohexanoic acid, X denotes all amino acids except Cys, 
and B is a biased mixture of the amino acids P, L, I, V, F, M 3 W. 

5 

Peptide Spot Array 

An ABIMED peptide arrayer with a computer controlled Gilson diluter and 
liquid handling robot was used to synthesize peptides onto an amino-PEG 
cellulose membrane using N-ot-FMOC-protected amino acids and DIC/HOBT 

10 coupling chemistry. The membrane was blocked in 5% milk/TBS-T (0.1%) for 2 
hours at room temperature, incubated with 0.1 ^iM GST-Plkl PBD (residues 326- 
603) in 5% milk, 50 mM Tris/HCl (pH 7.5), 150 mM NaCl, 2 mM EDTA, 2mM 
DTT for 1 hour at room temperature and washed with TBS-T (0.1%). It was then 
incubated with anti-GST conjugated HRP in 5% milk/ TBS-T (0.1%) for 1 hour at 

15 room temperature, washed with TBS-T (0.1%), and subjected to 
chemiluminescence. 

Domain Mapping and Protein Purification 

Limited proteolysis of Plkl (residues 326-603) and Cdc5p were performed 

20 using trypsin or endoproteinase Glu-C (Promega). N- and C-terminal limits were 
determined by Edman sequencing and electrospray mass spectrometry. DNA 
sequences encoding the proteolytically-defined domains were amplified by PCR 
and cloned into pGEX-6Pl (Cdc5p) or a version modified to allow ligation- 
independent cloning that also permits fusion-protein cleavage with TEV protease 

25 (Stols et al., Pro. Expr. Purif. 25:8-15 2002) (SJS - unpublished data). 
Recombinant PBDs were then expressed and purified as above. 
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Crystallization and Structure Determination 

For crystallization, the phosphopeptide MAGPMQSpTPLNGAYKK (SEQ 
ID NO:52) was mixed with the Plkl PBD fragment in a 1.5:1 stoichiometric 
excess and concentrated to -0.2 mM in a buffer containing 20mM Tris.HCl pH 
5 8.0/500mM NaCl, ImM EDTA, 3mM DTT. Crystals were grown by microbatch 
methods at 18°C using a Douglas Instruments IMP AX 1-5 crystallization robot 
and belong to monoclinic space-group P2j (a=62.4A, b=79.5A, c=62.0A, 
P=93.26°) with two complexes per asymmetric unit. Native data were collected 
on Station 14.1 at the SRS Daresbury using cryopreserved crystals at a 

10 temperature of 100°K. All data were reduced using the HKL suite of processing 
software (Otwinowski et al., Meth. Enzymol. 276:307-326, 1997). Phase 
information was derived from a three wavelength MAD experiment, using a single 
crystal of Se-methionine substituted PBD in complex with the phosphopeptide. 
Data for each wavelength were collected to a nominal 3.0A spacing on Station 

15 14.2 at the SRS, Daresbury, UK. Ten Se sites corresponding to five sites per 
monomer in the asymmetric unit were located, and the phases refined using 
SOLVE (Terwilliger et al, Acta Crystallogr. D. Biol Crystallogr 55:849-861, 
1999). Phases were extended to -2. 5 A against the native data using real-space 
non-crystallographic symmetry averaging with solvent flattening in RESOLVE 

20 (Terwilliger et al, Acta Crystallogr. D. Biol. Crystallogr 55:849-861, 1999). 
These maps were readily interpretable allowing a partial model of the PBD, 
together with seven residues of the phosphopeptide to be built using 'O' (Jones et 
al, Acta Crystallogr. A 47: 1 10-1 19, 1991). Subsequent refinement using native 
data to 1.9 A was carried out using CNS (Brunger et al, Acta Crystallogr. D Biol 

25 Crystallogr. 54:905-921, 1998) and REFMAC 5.0-ARP/wARP from the CCP4 
suite. A summary of statistics for the structure solution and refinement are shown 
in Table 5. Residues in bold: His538, Lys540, Trp414, and Leu491. 
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Figures were produced with Ribbons (Carson, J. Appl. Crystallogr. 24:958- 

961, 

1991) or SPOCK. 

Plkl PBD Binding to Cellular Substrates 

HeLa cells were transfected with His/Xpress-tagged Plkl (residues 326-603 
or 326-506) or myc-tagged Plkl (full-length). They were allowed to recover for 
17 hours and then arrested in G2/M by treatment with nocodazole (50 ng/mL) for 
14 hours. Cells were lysed in 25 mM Tris/HCl (pH7.5) containing 125 mM NaCl, 
0.5% NP-40, 5 mM EDTA, 2 mM DTT, 4 ug/mL pepstatin, 4 ug/mL aprotinin, 4 
ug/mL leupeptin, 1 mM Na 3 V0 4 , 50 mM NaF, and 1 uM microcystin. Lysates 

94- 

were incubated with 5 jiL Ni beads or 5 |iL a-myc-conjugated beads (Santa Cruz 
Biotechnology) for 90 minutes at 4°C. Beads were washed four times with lysis 
buffer. Precipitated proteins were eluted in sample buffer and detected by blotting 
with polyclonal anti-Cdc25C (Santa Cruz Biotechnology). Point mutations of 
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Plkl were constructed using the QuickChange site-directed mutagenesis system 
(Stratagene, La Jolla, CA) and verified by DNA sequencing. 

Centrosomal Localization of the Plkl PBD 

5 U20S cells were cultured in 8-well chamber slides and arrested in G2/M by 

treatment with nocodazole (50 ng/mL) for 14 hours. After rinsing with PBS 5 cells 
were incubated with 4 \iM GST-Plkl PBD (residues 326-603) and Streptolysin-0 
(1 U/ml) in permeabilization buffer (25 mM HEPES (pH 7.9), 100 mM KC1, 3 
mM NaCl, 200 mM sucrose, 20 mM NaF, 1 mM NaOV0 4 ) for 20 minutes at 

10 37°C. Cells were fixed in 3% paraformaldehyde/2% sucrose for 10 minutes at 
room temperature and extracted with a 0.5% Triton X-100 solution cpntaining 20 
mM Tris-HCl (pH 7.4), 50 mM NaCl, 300 mM sucrose, and 3 mM MgCl 2 for 10 
minutes at Room temperature. Slides were stained with Alexa Fluor 488- 
conjugated anti-GST (Molecular Probes, Eugene, OR) and monoclonal anti-y- 

15 tubulin (Sigma) antibodies at 4°C overnight, then stained with a Texas Red 

conjugated anti-mouse secondary antibody for 60 minutes at room temperature 
and counterstained with 4 jug/ml DAPI. Cells were examined using a Nikon 
Eclipse E600 fluorescence microscope equipped with a SPOT RT camera and 
software (Diagnostic Instruments Livingston, Scotland). Images were analyzed 

20 using NIH Image. 

Cell Cycle Analysis 

HeLa cells were transfected with wild-type and mutant forms of GFP- 
tagged Plkl (residues 326-603) for 32 hours. Media containing floating cells was 
25 retained, and attached cells were released from plates by trypsinization. The two 
cell populations were combined, washed with PBS, and stained with Hoechst 
33342 (lOjig/mL) for 30 minutes at 37°C in DMEM/ 1 0%FB S (lxlO 6 cells/mL). 
Dead cells were stained by incubation with propridium iodide (5 jig/mL) for 5 
minutes at 4°C. GFP, Hoechst 33342, and propidium iodide fluorescent signals 
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were quantitated on a FAC Star Plus (Becton Dickinson, Franklin Lakes, N J) cell 
sorting machine using Cell Quest software. Cell cycle analysis of the total live 
cell population (no propidium iodide staining) and live GFP-expressing cells (no 
propidium staining and GFP positive) was performed using Modfit 2.0. 

5 

Plkl Kinase Assays 

SF9 cells infected with baculoviral GST-Plkl (full-length) were lysed in 20 
mM Hepes/KOH (pH 7.5), 135 mM NaCl, 1% NP40, 5 mM EGTA, 5^iM p- 
mercaptoethanol, 35 mM NaF, 0.5 mM Na 3 V0 4 , 20 mM (3-glycerolphosphate, 3 

10 ^iM microcystin, 1 jiM okadaic acid, 10 (ig/mL pepstatin, 10 |ig/mL leupeptin, and 
10 ng/mL aprotinin. Lysates were incubated for 2 hours at 4°C with glutathione 
beads, which were subsequently washed five times with 20 mM Hepes/KOH (pH 
7.5), 415 mM NaCl, 0.1% CHAPS, 5 mM EGTA, 5^M p-mercaptoethanol, 35 
mM NaF, and 0.5 mM Na 3 V0 4 at 4°C. Bound proteins were eluted with a buffer 

15 containing 30 mM glutathione, 50 mM Hepes/KOH (pH 8.0), 25mM NaCl, 2mM 
MgCl 2 , ImM EGTA, and 5\xM (3-mercaptoethanol and dialyzed against lOmM 
Hepes, lOmM NaCl, ImM EGTA, ImM DTT for 3 hours at 4°C. Kinase reactions 
were performed in 20 mM Hepes/KOH (pH7.5), 15 mM KC1, 10 mM MgCl 2 , 1 
mM EGTA, 100 \xM ATP, 5^iCi y-[ 32 P]-ATP, 1 mM DTT, and 0.1 |ug/|ugL casein 

20 for 15 minutes at 30°C. Reaction aliquots were removed at various time points, 
added to sample buffer, and boiled to arrest phosphorylation. P-incorporation 
into casein was determined by SDS-PAGE electrophoresis, autoradiography, and 
densitometry using ImageQuant software (Molecular Dynamics). For peptide 
activation experiments, 250 jiM of the PBD optimal phosphopeptide 

25 (MAGPMQSpTPLNGAKK) or its non-phosphorylated counterpart 

(MAGPMQSTPLNGAKK) were pre-incubated with GST-Plkl for 5 minutes at 
room temperature. 
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Molecular Modeling in silico 

The present invention provides an exemplary crystallized PBD- 
phosphopeptide complex and the atomic structural coordinates of this complex. 
The key structural features of the complex, particularly the shape of the substrate 
5 binding site, are useful in methods for designing or identifying selective inhibitors 
of a Polo-like kinase polypeptide, such as Plk-1, and in solving the structures of 
other proteins with similar features. The structure coordinates of this complex are 
encoded in a data storage medium, submitted herewith, for use with a computer 
for graphical three-dimensional representation of the structure and for computer- 

10 aided molecular design of new inhibitors. The differences in three-dimensional 
structure between PLK-1 and related proteins with known structures can be used 
to optimize selectivity of an inhibitor for PBD. In addition to the structural 
differences described herein, other differences between Plk-1 and other proteins 
can also be identified by a skilled artisan. 

15 The three-dimensional atomic structures reported herein can be readily used 

as a template for selecting potent inhibitors, such as small molecules or 
peptidomimetics that are designed to "fit" into the binding interface. Methods for 
designing peptidomimetics using rational drug design are known to the skilled 
artisan, and are described, for example, in U.S. Patent Nos: 6,225,076; 6,171,804; 

20 and in Han et al (Bioorg Med Chem, Lett, 10:39-43, 2000). Peptidomimetics 
capable of inhibiting complex formation can be identified, for example, through 
the use of computer modeling using a docking program such as GRAM, DOCK, 
or AUTODOCK (Dunbrack et al., Folding & Design, 2:27-42, 1997). This 
procedure can include computer fitting of candidate compounds to a the binding 

25 interface of a particular polypeptide to determine whether the shape and chemical 
structure of the potential ligand will allow it to bind within the structure of the 
polypeptide. Many methods can be used for this purpose such as, but not limited 
to, fast shape matching (Dock [Kuntz et al., J. Mol Biol, 161:269-288, 1982]; 
Eudock [Perola et al., J. Med, Chem,, 43:401-408, 2000]), incremental 
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construction (FlexX [Rarey et al., J Mol Biol, 261, 470-89, 1996]; 
HAMMERHEAD [Welch et al., Chem. Biol, 3, 449-462, 1996]), TABU search 
(Pro_Leads [Baxter et al., Proteins 33:367-382, 1998]; SFDock [Hou et al., 
Protein Eng. 12:639-647, 1999]), genetic algorithms (GOLD [Gold et al., J. Mol 
5 Biol 267:727-748, 1997]; AutoDock 3.0 [Morris et al., J. Comput. Chem., 

19:1639-1662, 1998]; Gambler [Charifson et al., J. Med. Chem., 42:5100-5109, 
1999]), evolutionary programming [Gehlhaar et al., Chem. Biol, 2:317-324, 
1995], simulated annealing (AutoDock 2.4 [Goodsell et al., Proteins, 8:195-202, 
1990]), Monte Carlo simulations (MCDock [Liu et al., J. Comput. -Aided Mol 

10 Des., 13:435-451, 1999]; QXP [McMartin et al., J. Comput. -Aided Mol. Des., 
11:333-344, 1997]), and distance geometry (Dockit [Metaphorics LLC, Piemont, 
CA 9461 1 www.metaphorics.com ]). 

Those skilled in the art can readily identify many small molecules or 
fragments as hits. If desired, one can link the different functional groups or small 

15 molecules identified by the above procedure into a single, larger molecule. The 
resulting molecule is likely to be more potent and have higher specificity. The 
affinity and/or specificity of a hit can also be improved by adding more atoms or 
fragments that will interact with the target protein. The originally defined target 
site can be readily expanded to allow further necessary extension. Selected 

20 compounds may be systematically modified by computer modeling programs to 
identify peptidomimetics having the greatest therapeutic potential. Alternatively, 
candidate compounds are selected from chemical libraries, or are synthesized de 
novo. 

The structural analysis disclosed herein in conjunction with computer 
25 modeling allows the selection of a finite number of rational chemical 

modifications. Thus, using the complex structure disclosed herein and computer 
modeling, a large number of candidate compounds can be rapidly screened in 
silico, and the most promising candidates can be identified. Candidate 
compounds, such as peptidomimetics, are then verified in vitro or in vivo, for 
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example, by determining the effect of the candidate compound on 
PBD/phosphopeptide binding, Polo-like kinase biological activity, cell cycle 
regulation, apoptosis, or cell proliferation. 



5 pSer/pThr-binding domains function in the cellular response to genotoxic 
stress 

Signal transduction by protein kinases in eukaryotes results in the directed 
assembly of multi-protein complexes at specific locations within the cell (Pawson 
et al., Science 300:445-52, 2003). This process is particularly evident following 

10 DNA damage, where activation of DNA damage kinases results in the formation 
of protein-protein complexes at discrete foci within the nucleus (Zhou et al., 
Nature 408:433-9, 2000). 

In many cases, kinases directly control the formation of these multi-protein 
complexes by generating specific phosphorylated-motif sequences; modular 

15 binding domains then recognize these short phospho-motifs to mediate protein- 
protein interactions. The first phosphopeptide-binding modules that were 
recognized, SH2 and PTB domains, bind specificially to pTyr-containing 
sequences (Pawson et al., Science 278:2075-80, 1997; Kuriyan et al., Annu Rev 
Biophys Biomol Struct 26:259-88, 1997;Yaffe, Nat Rev Mol Cell Biol 3: 177-86, 

20 2002). As detailed above, a number of modular domains that specifically 
recognize short pSer/pThr-containing sequences have now been identified, 
including 14-3-3 proteins, WW domains, FHA domains, and the C-terminal 
domain of Polo-like kinases (Yaffe et al., Structure 9:R33-8, 2001; Yaffe et al., 
Curr Opin Cell Biol 13:131-8, 2001; Elia et al., Science 299:1228-31, 2003). All 

25 of these pSer/pThr-binding domains participate in cell cycle regulation and the 
cellular response to genotoxic stress. 
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The PTIP tandem C-terminal BRCT pair is necessary and sufficient for 
phospho-specific binding 

Using the proteomic screening approach (Elia et al., Science 299:1228-31, 
2003). described herein, we have now identified novel modular pSer/pThr-binding 
5 domains involved in the DNA damage response. Following y-irradiation, 

phosphoinositide-like kinases including ATM/ATR and DNA-PK phosphorylate 
transcription factors, DNA repair proteins, protein kinases and scaffolds on Ser- 
Gln and Thr-Gln motifs (Abraham, Genes Dev 15:2177-96, 2001). We therefore 
constructed an oriented peptide library biased to resemble the (pSer or pThr)-Gln 

10 motif generated by ATM and ATR (Kim et al., J Biol Chem 274:37538-43, 1999; 
O'Neill Qt^JBiolChem 275:22719-27, 2000). (Figure 17A legend). An 
immobilized form of this library was used in an interaction screen against a library 
of proteins produced by in vitro expression cloning (Lustig et al, Methods 
Enzymol 283:83-99, 1997). The amino acids Arg, Lys, and His were intentionally 

15 omitted from the degenerate positions in the peptide library to decrease the 

likelihood of identifying phosphopeptide-binding domains such as 14-3-3, which 
target basophilic motifs generated by kinases such as AKT, PKA, and PKCs. To 
control for phosphorylation-independent binding, an identical peptide library was 
constructed with (Ser or Thr)-Gln substituted for (pSer or pThr)-Gln. 

20 The phosphorylated and non-phosphorylated peptide libraries were 

immobilized on streptavidin beads, and screened against approximately 96,000 in 
vitro translated (IVT) polypeptides (960 pools each encoding ~ 100 transcripts) 
over a 10 week period using a high-throughput approach. The majority of IVT 
products either failed to bind to either of the immobilized peptide libraries or 

25 bound slightly better to the non-phosphorylated control (Figure 17A). Several 

pools were found to contain cDNAs encoding proteins which bound preferentially 
to the (pSer or pThr)-Gln library. Pool EE1 1 contained the strongest 
phosphopeptide-binding clone, EE 1 1-9, which when sib-selected, was found to 
encode the C-terminal 70% of the human Pax2 /m^-activation domain-interacting 
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protein (PTIP) (Figure 17B) (Lechner et al., Nucleic Acids Res 28:2741-51, 2000; 
Cho et al., Mol Cell Biol 23:1666-73, 2003). Originally identified in a yeast 2- 
hybrid screen using Pax2 as bait (Lechner et al., Nucleic Acids Res 28:2741-51, 
2000), PTIP appears to play a critical role in the DNA damage response pathway 
5 (Cho et ah, Mol Cell Biol 23:1666-73, 2003), as well as in facilitating 

transcriptional responses downstream of TGF-P-Smad2 signaling (Shimizu et al., 
Mol Cell Biol 21:3901-12, 2001). 

Full-length PTIP transcripts also displayed preferential binding to (pSer or 
pThr)-Gln peptides, though the differential binding was somewhat less 

10 pronounced, suggesting that the C-terminal fragment of PTIP likely contains a 
discrete phosphopeptide binding module. In addition to its Gin-rich region, 
human PTIP contains 4 BRCT domains, which are known protein-protein 
interaction modules present in many DNA damage response and cell cycle 
checkpoint proteins z (Huyton et al., Mutat Res 460:319-32, 2000). A series of 

15 deletion constructs was therefore generated and analyzed for phosphopeptide- 

specific binding (Figure 17B). A construct containing only the tandem 3 rd and 4 th 
BRCT domains showed strong and specific binding to the (pSer or pThr)-Gln 
library. Constructs of PTIP lacking both of these domains failed to bind or lacked 
phospho-discrimination. Furthermore, neither the 3 rd or 4 th BRCT domains alone 

20 bound to phosphopeptides, suggesting that the PTIP tandem C-terminal BRCT 
pair functions as a single module that is necessary and sufficient for phospho- 
specific binding. 

Tandem BRCT domains function as single unit to mediate phosphopeptide- 
25 binding 

BRCT domains are often found in tandem pairs, or multiple copies of 
tandem pairs. To investigate whether (pSer- or pThr)-binding is a general feature 
of these domains, we screened tandem BRCT pairs from a number of other DNA 
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damage proteins (Figure 18A). Like PTIP, the BRCA1 C-terminal BRCT 
domains also showed phospho-specific binding. Neither of the BRCA1 BRCT 
domains alone was sufficient for phospho-specific interactions, again suggesting 
that the tandem BRCT domains are functioning as a single unit. This observation 
5 is in excellent agreement with limited proteolysis and X-ray crystallography 
studies in which the tandem BRCA1 BRCT domains together with the inter- 
domain linker behave as a single stable fragment (Williams et al., Nat Struct Biol 
8:838-42, 2001). In contrast to PTIP and BRCA1, phospho-specific binding to the 
tandem BRCT domains of MDC1 or 53BP1 was not observed, and only a very 
10 low amount of phospho-specific binding for Rad9 was detected, suggesting that 
the phosphopeptide-binding function is present in only a subset of tandem BRCT 
domains. 



Identification of Optimal tandem BRCT domain-binding peptide 

1 5 Modular domains identified by binding to bead-immobilized 

phosphopeptide libraries are directly amenable to determination of their optimal 
binding motif by traditional peptide library screening (Yaffe et al., Methods 
Enzymol 328:157-70, 2000; Elia et al., Science 299:1228-31, 2003). We 
determined the optimal pSer/pThr binding motifs for the tandem C-terminal 

20 BRCTs in PTIP and BRCA1 using (pSer or pThr)-Gln, pSer- and pThr-containing 
peptide libraries (Figure 18B and 18C, Table 4). 
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Table 6 Phosphoserine and phosphothreonine peptide motif selection by PTIP 
and BRCA1 Tandem BRCT motifs 



Phosphoserine and Phosphothreonine Peptide 

Motif Selection bv PTIP and BRCA1 Tandem BRCT Domains 



PTIP 



-4 -3 


-2 


-1 




+1 


+2 


+3 


+4 


+5 


X Y(1.5) 


G (2 3) 


L (2 6) 


pS/pT 


S2 


_v 


c 

Li 


p h 6) 


I (2 9) 




D(1 .5) 
E(1.4) 


I (2.5) 
M (2.5) 
V(1.9) 






(3.8) 

I (2.8) 


L 

(4.3) 
1(4.1) 




F(2.7) 
L(2.4) 

V (2.0) 

Y (2.0) 


X X 


E(1.3) 


1(1.4) 


pS 


F(1.7) 


V(1.8) 


F 


X 


1(1.9) 






M(1.4) 


1(1.5) 


T(1.5) 






F(1.7) 






V(1.4) 




Q(1.5) 








M (1.6) 






L(1.3) 




Y(1.3) 








L(1.4) 


G(1.6) Y(1.1) 


D(1.2) 


L(1.2) 


pS 


Q(1.3) 


V(2.1) 


F (2.3) 


P(1.2) 


Y(1.3) 




E(1.1) 


1(1.2) 
M(1.2) 


1(1.3) 
P(1.2) 


1(1.7) 


I (2.3) 
V(1.8) 
L(1.7) 
Y(1.5) 






X X 


X 


1(2.1) 


PT 


Q(1.5) 


Y(1.4) 


1(1.4) 


F(1.5) 


A 






L(1.8) 


F(1.4) 




L(1.3) 


Y(1.4) 








W 




1(1.3) 




V(1.2) 


P(1.3) 








(1-3) 
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BRCA1 



-4 


-3 


-2 


-1 




+1 


+2 


+3 


+4 


+5 


X 


F(1.7) 


D(1.2) 


1(1.4) 


dS/dT 


O 


y 


F 


V(1.5) 


f 




Y(1.6) 


E (1.1) 


V(1.3) 






{3/Q 


/— 7 r- \ 


P(1.4) 










L(1.2) 














M(1.2) 






T (2.6) 


Y 




G(1.8) 














1 (2.2) 


(5.2) 


















S(1.7) 






X 


R(1.5) 


E(1.3) 


V(1.4) 


pS 


F(2.1) 


T(1.9) 


F 


X 


F(1.6) 




Yd .4) 


D(1.2) 


1(1.3) 


Y(1.6) 


V(1.7) 






M (1.4) 








M(1.3) 




1(1.4) 








Y(1.3) 












Q(1.4) 










X 


X 


Y(1.2) 


X 


PS 


0(1-4) 


V(1.2) 


F (2.4) 


1(1.2) 


X 










F(1.3) 


1(1.2) 


Y(1.5) 






X 


E(1.5) 


D(1.9) 


1(1.6) 


PT 


0(1.5) 


D(1.5) 


F(1.9) 


D(1.4) 


A 






E(1.5) 


L(1.4) 


E(1.4) 


Y(1.3) 


Y(1.2) 


P(1.2) 





F(1.3) 1(1.2) 



A GST fusion of the PTIP or BRCA1 tandem BRCT domains was screened for binding to four 
phosphopeptide libraries, which contained the sequences GAXXXB(pS/pT)QJXXXAKKK, 
GAXXXXpSXXFXXAYKKK, MAXXXXpTXXXXAKKK, and MAXXXXSpXXXXXAKKK, where X indicates all 
5 amino acids except Cys. In the library MAXXXB(pS/pT)QJXXXAKKK B indicates A, I, L, M, N, P, S, T, V, 
and J represents a biased mixture of 25% E, 75% X, while X indicates all amino acids except Arg, Cys, His, 
Lys for all positions in this library. Residues showing strong enrichment are underlined. 

Table 6 shows the results of a phosphoserine and phosphothreonine motif 
10 selection by PTIP and BRCA1 tandem BRCT domains. A GST fusion of the PTIP 
or BRCA1 tandem BRCT domains was screened for binding to three 
phosphopeptide libraries, which contained the sequences 
MAXXXB(pS/pT)QJXXXAKKK SEQ ID NO:53, MAXXXXpTXXXXAKKK 
SEQ ID NO:54, and MAXXXXSpXXXXXAKKK SEQ ID NO:55; where X 
15 indicates all amino acids except Cys. In the libraries 
MAXXXB(pS/pT)QJXXXAKKK (SEQ ID NO:56) and 
GAXXXXpSXXFXXAYKKK, B indicates A, I, L, M, N, P, S, T, V; and J 
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represents a biased mixture of 25% E 3 75% X, while X indicates all amino acids 
except Arg 5 Cys, His, Lys. Residues showing very strong enrichment (ratio >3) 
are underlined. 

PTIP and BRCA1 BRCTs displayed similar, but not identical motifs, with 
5 extremely strong selection for aromatic/aliphatic residues, and aromatic residues, 
respectively, in the (pSer or pThr)+3 position when screened with a (pSer or 
pThr)-Gln library. Prominent amino acid selection was also observed in the (pSer 
or pThr)+2 and +5 positions, in addition to more moderate selection at other 
positions. Because the BRCT domains were isolated in a screen for domains that 

10 bind to (pSer or pThr)-Gln motifs, we investigated the relative importance of Gin 
in the (pSer or pThr)+l position using individual pThr- or pSer-oriented peptide 
libraries. This analysis revealed modest selection for Gin in the degenerate +1 
position. Furthermore, the absence of a fixed Gin in the +1 position reduced the 
selection for aromatic and aliphatic residues in the +3 and +5 positions, suggesting 

15 that while Gin in the (pSer or pThr)+l position was not essential, it was clearly a 
favored residue. In agreement with this finding, we observed considerably 
stronger binding of the tandem BRCT domains to bead-immobilized (pSer or 
pThr)-Gln libraries than to libraries containing only a fixed pSer motif (Figure 
18A). 

20 On the basis of peptide library data, we defined an optimal tandem BRCT 

domain-binding peptide as Y-D-I-(pSer or pThr)-Q-V-F-P-F. Isothermal titration 
calorimetry (ITC) showed that the optimal phosphoserine-containing peptide 
bound to the tandem C-terminal BRCTs of PTIP with a dissociation constant of 
280 nM, and to the BRCT domains of BRCA1 with a dissociation constant of 400 

25 nM (Table 7). 
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Table 7 Peptide binding affinities for the tandem BRCT domains 



Table S2. Peptide Binding Affinities for the Tandem BRCT Domains 



Peptide 


Sequence 


(BRCTh Domain 


Kn 


BRCTtide-7pS 


GAAYDI-pS-OVFPFAKKK 


PTIP 


280 nM 


BRCTtide-7pT 


GAAYDl-pT-QVFPFAKKK 


PTIP 


14.3MM 


BRCTtide-7S 


GAAYDk S-OVFPFAKKK 


PTIP 


N.D.B. 


BRCTtfcJe«7T 


GAAYDI- T-QVFPFAKKK 


PTIP 


N.D.B. 


BRCTtide-7pS 


GAAYDI~pS-QVFPFAKKK 


BRCA1 


400 nm 


BRCTtide.7S 


GAAYDI S-QVFPFAKKK 


8RCA1 


N.D.B. 


BRCTlftte-TT 


GAAYDI- T-QVFPFAKKK 


8RCA1 


N.DB. 



Isothefmal titration calorimetry (ITC) was used to determine binding constants (K d ). AH observed binding 
stoichtometries were consaslenl with alt complex of protein and phosphopeptide. N.D.B indcatea no 
ctetectabte btndng by ITC for a tandem BRCT domain with a concentration of at least 1 50^M. pS and pT 
denote phosphoearine and phosphothreo nine , respectively. 

PTIP and BRCA1 tandem BRCT domains were purified as GST-fusion 
proteins from E. coli and binding to individual peptides measured by isothermal 
5 titration calorimetry. Binding stoichiometries were consistent with a 1:1 complex 
of protein and phosphopeptide. Replacement of pThr for pSer reduced the affinity 
of the peptide for the PTIP BRCT domains, while substitution of Thr for pThr 
abrogated binding altogether. 

To further verify motif selection, binding of the tandem BRCT domains to 

10 a solid-phase array of immobilized phosphopeptides was performed in which each 
amino acid flanking the pThr-Gln core (Figure 18D and 18E) or flanking the pSer 
(Figures 18F and 18G) in the optimal BRCTtide was varied. The resulting 
selectivities were generally consistent with the results obtained using oriented 
peptide libraries in solution. Substitution of pSer for pThr significantly enhanced 

15 binding for both PTIP and BRCA1, consistent with the ITC results for PTIP. 

Substitution of pTyr for pThr eliminated binding altogether, verifying that tandem 
BRCT domains are pSer/pThr-specific binding modules. As expected, 
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replacement of pThr with Thr, Ser or Tyr abrogated tandem BRCT domain 
binding. 

Tandem BRCT domain binding eliminated by pre-incubation with (pSer or 
5 pThr)-Gln peptide library 

To examine the role of tandem BRCT domains in binding to 
ATM/ATR/ATX-phosphorylated proteins after DNA damage, U20S cell lysates, 
prior to and following 10 Gy of y-irradiation, were incubated with GST-(BRCT) 2 
fusion proteins and blotted with an anti-(pSer or pThr)-Gln motif antibody raised 

10 against the phosphorylation motif generated by ATM and ATR (Cell Signaling 
Technologies) (Figures 19A-19D). Following y-irradiation, both PTIP and 
BRCA1 tandem C-terminal BRCTs bound to numerous proteins recognized by the 
anti-ATM/ ATR phosphoepitope motif antibody (Figure 19A). This interaction 
could be inhibited by pre-incubating the tandem BRCT domains with a (pSer or 

15 pThr)-Gln peptide library, but not with a pThr-Pro library or with the non- 

phosphorylated (Ser or Thr)-Gln library. A time course analysis revealed optimal 
binding of both the PTIP and BRCA1 BRCT domains to (pSer or pThr)-Gln- 
containing proteins in irradiated cell lysates at 0.5 and 2 hours after DNA damage 
(Figure 19B and 19D). Binding was largely eliminated by the optimal BRCTtide 

20 (opt), but not by its non-phosphorylated analogue (7T), or by pre-treatment of the 
cells with caffeine to inhibit ATM and ATR prior to y-irradiation. In both cases 
where the phospho-specific interaction was eliminated, we observed a -170 kDa 
immunoreactive band in the PTIP BRCT domain pulldowns, but not in the 
BRCA1 pulldowns; this band likely resulted from an interaction with the PTIP 

25 BRCT domains at a site distinct from its phosphopeptide-binding pocket. 
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Tandem C-terminal BRCT domains are necessary and sufficient for nuclear 
foci formation following DNA damage 

In response to y-irradiation, the DNA damage protein 53BP1 undergoes 
phosphorylation by ATM and facilitates the ability of ATM to phosphorylate 
5 additional cellular substrates (Schultz et al., J Cell Biol 151:1381 ,2000; 
Rappold et al, J Cell Biol 153:613-20, 2001; Anderson et al., Mol Cell Biol 
21:1719-29, 2001; Abraham, Nat Cell Biol 4:E277-9, 2002; Wang et al., Science 
298:1435-8, 2002; Fernandez-Capetillo et al., Nat Cell Biol 4:993-7, 2002; 
DiTullio, Jr. et al., Nat Cell Biol 4:998-1002, 2002). 53BP1 migrates at a similar 

10 Mr as one or more of the bands in Figure 19A and 19B and contains multiple 
potential Ser/Thr-Gln ATM/ATR phosphorylation sites that closely match the 
optimal PTIP tandem BRCT-binding motif. Endogenous 53BP1 from U20S cell 
lysates bound to the tandem C-terminal BRCT domains of PTIP only following 
DNA damage (Figure 19C). Similar to the results obtained with the (pSer or 

15 pThr)-Gln motif antibody, a time course of cells transfected with HA-tagged 

53BP1 revealed optimal binding at 0.5 and 2 hours following y-irradiation. This 
binding was inhibited by preincubation with optimal BRCTtide, but was not 
eliminated by pre-incubation with its non-phosphorylated counterpart. Binding 
was also eliminated by pre-incubation of the tandem BRCT domains with the 

20 (pSer or pThr)-Gln peptide library, but not by pre-incubation with a pThr-Pro 

library or the non-phosphorylated (Ser or Thr)-Gln library, as well as by treatment 
with caffeine prior to y-irradiation or treatment of the lysates with ^-phosphatase 
following irradiation. 

Although PTIP was originally identified as a transcriptional control protein, 

25 recent data suggests that PTIP might also be involved in DNA damage signaling 
(Cho et al., Mol Cell Biol 23:1666-73, 2003). Mice homozygous for a PTIP null 
allele undergo embryonic lethality at E9.5, with evidence of extensive DNA 
damage and the presence of free DNA ends. Neither fibroblasts nor embryonic 
stem cells from PTIP null mice could be propagated in culture, and trophoblast 
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cells, which showed decreased viability in general, showed an increased sensitivity 
to low doses of ionizing radiation (Cho et al., Mol Cell Biol 23:1666-73, 
2003). This data, together with our finding that the tandem BRCT domains at the 
C-terminus of PTIP bind to ATM/ATR phosphorylated proteins, suggested that 
5 full-length PTIP might localize at sites of DNA damage in vivo. 

To investigate this, U20S cells were transfected with GFP fusions of full- 
length PTIP, PTIP lacking the last two C-terminal BRCT domains, or the isolated 
tandem C-terminal BRCT domains alone (Figures 20A-20C). In the absence of 
irradiation, PTIP was diffusely nuclear with a small amount of cytosolic staining. 

10 Two hours following DNA damage, PTIP re-localized into discrete nuclear foci 
that significantly co-localized with ATM/ATR phosphoepitopes, 53BP1 and 
phospho-H2AX (Figure 20A). Deletion of the C-terminal BRCTs from PTIP 
resulted in its constitutive diffuse nuclear and cytoplasmic localization and an 
inability to form foci after DNA damage (Figure 18B). The isolated PTIP C- 

1 5 terminal tandem BRCT domains, while predominantly diffusely nuclear in the 
absence of DNA damage, efficiently re-localized into the same punctate nuclear 
foci after y-irradiation as full-length PTIP (Figure 18C). Thus, the tandem C- 
terminal BRCT domains of PTIP, which are necessary and sufficient for binding 
to (pSer or pThr)-Gln peptides in solution, are necessary and sufficient for nuclear 

20 foci formation by full-length PTIP following DNA damage. 

Caffeine attenuates recruitment of PTIP to DNA damage foci in response to 
ionizing radiation (Figures 21 A and 2 IB). U20S cells transfected with full-length 
PTIP-GFP cDNA were mock treated or pretreated with 1 OmM caffeine for 70 
minutes before exposure to lOGy ionizing radiation. In reponse to IR ionizing 

25 radiation, mock-treated U20S cells formed nuclear foci containing PTIP (in green) 
and H2AXp (in red); these two proteins co-localize at sites of DNA damage 
(merge). In response to IR, caffeine treated U20S cells formed reduced numbers 
of nuclear foci; PTIP was mislocalized and did not form discrete nuclear foci (in 
green) and there were reduced numbers of H2AXp (in red) containing foci. These 
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results demonstrate that pretreatment with caffeine effectively abolished co- 
localization of PTIP and H2AXp (merge). 

Our identification of tandem BRCT domains as a new pSer/pThr-binding 
module targeting ATM and ATR phosphorylation motifs expands the range of 
5 functions subserved by this domain in response to DNA damage signaling. Only 
tandem pairs were observed to function in this capacity, and only a subset of 
BRCT domains, including those in PTIP and BRCA1, appear to show phospho- 
specific binding. The important role for tandem BRCT domains as phospho- 
binding modules is emphasized by the finding that -80% of germline mutations in 

10 BRCA1 result in C -terminal truncations involving the BRCT region, predisposing 
women to breast and ovarian cancer (Huyton et al., Mutat Res 460:3 19-32, 2000). 
Interestingly, a BRCA1 cancer-associated mutation in the (BRCT)2 module that 
ablates critical BRCA1 protein interactions, Met 1 7753 Arg (M1775R), fails to bind 
phosphopeptides (Fig. 2 A), even though the M1775R crystal structure is nearly 

15 identical to that of the wild-type (BRCT)2 . The finding that BRCT domains bind 
to pSer-containing peptides more strongly than to pThr-containing peptides is 
novel since WW domains, 14-3-3 proteins, FHA domains and Polobox domains 
either bind pThr-peptides better than pSer peptides, or do not bind to pSer- 
peptides at all (Verdecia et al., Nat Struct Biol 7:639-43, 2000; Durocher et al., 

20 Mol Cell 6:1 169-1 182, 2000; Elia et al. Science 299:1228-31, 2003). 

Intriguingly, ATM and ATR preferentially phosphorylate Ser-Gln over Thr-Gln 
motifs (Kim et al, J Biol Chem 274:37538-43, 1999), suggesting functional 
convergence between the motifs generated by phosphoinositide-like kinases and 
the motifs recognized by BRCT domains. The observed BRCT domain selection 

25 for aromatic and aliphatic residues in the (pSer or pThr)+3 and +5 positions within 
their bound substrates exceeds their modest selection for Gin in the +1 position. 
Thus, only a subset of ATM/ ATR phosphorylated substrates are likely to bind 
with high affinity. Kinases other than Gin-directed kinases might also generate 
potential BRCT domain-binding motifs. In addition, the results of our screen 
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provide a molecular rationale for the early embryonic lethality of PTIP knock-out 
mice with extensive unrepaired DNA ends. The finding that the C-terminal ' 
tandem BRCT domains of PTIP bind to ATM/ATR-phosphorylated motifs and 
localize full-length PTIP to sites of DNA damage strongly suggests that PTIP 
functions as a key component of the DNA damage response. Interference with the 
normal process of DNA damage signaling is responsible not only for 
tumorigenesis but also for tumor cell death in the face of massive DNA damage 
induced by chemotherapeutic agents, depending on the remaining genetic 
background of the cancer cell (Scully et al., Nature 408:429-32, 2000). Agents 
that interfere with DNA damage signaling sensitize tumor cells to killing by 
radiation and chemotherapy. Thus, the phosphopeptide-binding pocket of tandem 
BRCT domains constitutes a promising target for anti-cancer drug development. \ 

ATM/ATR/ATX phospho-motif screen for phosphoserine/threonimie binding 
15 domains 

An oriented (pSer/pThr) phosphopeptide library biased toward the 
phosphorylation motifs for ATM/ATR kinases and its non-phosphorylated 
counterpart were constructed as follows: 

biotin-Z-G-Z-G-G-A-X-X-X-B-(pS/pT)-QJ-X-X-X-A-K-K-K SEQ ID NO:57and 
20 biotin-Z-G-Z-G-G-A-X-X-X-B-(S/T)-Q-J-X-X-X-A-K-K-K SEQ ID NO:58, 
where pS denotes phosphoserine; pT phosphothreonine; Z indicates 
aminohexanoic acid; B represents a biased mixture of the amino acids A, I, L, M, 
N, P, S, T, V; and J represents a biased mixture of 25% E and 75% X, where "X" 
denotes all amino acids except Arg, Cys, His, Lys. Streptavidin beads (Pierce, 
25 75pmol/jiL gel) were incubated with a ten-fold molar excess of each biotinylated 
library in 50 mM Tris/HCl (pH7.6), 150 mM NaCl, 0.5% NP-40, 1 mM EDTA, 2 
mM DTT and washed five times with the same buffer to remove unbound peptide. 
The bead-immobilized libraries (IOjiL of gel) were added to 10 jiL of an in vitro 
translated [ 35 S]-labeled protein pool in 150 jaL binding buffer (50 mM Tris/HCl 
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(pH7.6), 1 50 mM NaCl, 0.5% NP-40, 1 mM EDTA, 2 mM DTT, 8 ^ig/mL 
pepstatin, 8 |ig/mL aprotinin, 8 |ig/mL leupeptin, 800 ^iM Na3V04, 25 mM NaF). 
Each pool consisted of -100 radiolabeled proteins produced by the 
PROTEOLINK in vitro expression cloning system (Promega, Madison, WI). 
5 After incubation at 4°C for 3 hours, the beads were rapidly washed three times 200 
with binding buffer prior to SDS-PAGE (12.5%) and autoradiography. 
Positively scoring hits were identified as protein bands that interacted more 
strongly with the phosphorylated immobilized library than with the 
unphosphorylated counterpart. Pools containing positively scoring clones were 
10 progressively subdivided and re-screened for phosphobinding until single clones 
were isolated and identified by DNA sequencing. 

Clooing, expression, and purification of PTIP and BRCA1 

For deletion mapping of the PTIP and BRCA1 BRCT phospho-binding 

15 region and for expression of MDC1, 53BP1 and Rad9 (Figure 17-18), fragments 
were generated by PCR for in vitro transcription/translation and cloned into a 
pCDNA3.1 expression vector (Invitrogen, San Diego, California). For production 
of recombinant GST-PTIP BRCT domains and GSTBRCA1 BRCT domains, 
residues 550-757 of PTIP and residues 1634-1863 of BRCA1 were ligated into the 

20 EcoRI and Notl sites of pGEX-4Tl (Pharmacia, Peapack, NJ) and subsequently 
transformed into DH5a E. Coli. Protein induction occurred at 37°C for 4 hours or 
at 25°C for 16 hours in the presence of 0.4 mM IPTG. For peptide filter blot 
analysis and measurements of peptide binding affinity by ITC, GSTPTIP BRCT 
domains (residues 550-757) and GST-BRCA1 BRCT domains (residues 1634- 

25 1863) were isolated from bacterial lysates using glutathione agarose, eluted with 
40mM glutathione, and dialyzed into 50mM Tris/HCl (pH 8.1), 300mM NaCl. 
The GFP-PTIP constructs FL (residues 1-757), !BRCT (residues 1-550), or 
(BRCT)2 (residues 550-757) were cloned into the EcoRI and Sail sites of the 
pEGFP-C2 (BD Biosciences Clontech Franklin Lakes, NJ) expression vector. 
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Peptide library screening 

Phosphoserine and phosphothreonine oriented degenerate peptide libraries 
consisting of the sequences 
5 Gly-Ala-X-X-X-B-(pSer/pThr)-Gln-J-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:59, 
Met-Ala-X-X-X-X-pThr-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:60, 
and Met-Ala-X-X-X-XpSer-X-X-X-X-X-Ala-Lys-Lys-Lys SEQ ID NO:61; where 
pS is phosphoserine, pT is phosphothreonine; and X denotes all amino acids 
except Cys. In the (pSer/pThr)-Gln library, B is a biased mixture of the amino 

10 acids A 5 1, L, M, N, P, S, T, V, and J represents a biased mixture of 25% E, 75% 
X 5 where X denotes all amino acids except Arg, Cys, His, Lys. Peptides were 
synthesized using N-a-FMOC-protected amino acids and standard BOP/HOBt 
coupling chemistry. Peptide library screening was performed using 125 jixl of 
glutathione beads containing saturating amounts of GST-PTIP BRCT or GST- 

15 BRCA1 BRCT domains (1-1.5 mg) as described by Yaffe and Cantley {Methods 
Enzymol 328:157-70, 2000). Beads were packed in a lmL column and incubated 
with 0.45 mg of the peptide library mixture for 10 minutes at room temperature in 
PBS (150 mM NaCl, 3 mM KC1, 10 mM Na2HP04, 2 mm KH2P04, pH 7.6). 
Unbound peptides were removed from the column by two washes with PBS 

20 containing 1 .0% NP-40 followed by two washes with PBS. Bound peptides were 
eluted with 30% acetic acid for 10 minutes at room temperature, lyophilized, 
resuspended in H20, and sequenced by automated Edman degradation on a 
PROCISE protein microsequencer (Perkin-Elmer Corporation, Norwalk CT). 
Selectivity values for each amino acid were determined by comparing the relative 

25 abundance (mole percentage) of each amino acid at a particular sequencing cycle 
in the recovered peptides to that of each amino acid in the original peptide library 
mixture at the same position. 
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Isothermal Titration Calorimetry 

Peptides were synthesized by solid phase technique with three C-terminal 
lysines to enhance solubility. The peptides were then purified by reverse phase 
HPLC following deprotection and confirmed by MALDI-TOF mass spectrometry. 
5 Calorimetry measurements were performed using a VP-ITC microcalorimeter 
(MicroCal Inc., Studio City, CA). Experiments involved serial IOjiL injections of 
peptide solutions (20)iiM-150^iM) into a sample cell containing 15jiM GST-PTIP 
BRCT domains (residues 550-757) or \5\xM GST-BRCA1 BRCT domains 
(residues 1634-1863) in 50mM Tris/HCl (pH 8.1), 300mM NaCl. Twenty 
10 injections were performed with 240 second intervals between injections and a 

reference power of 25 |uCal/s. Binding isotherms were plotted and analyzed using 
ORIGIN Software (MicroCal 
Inc. Studio City, CA). 

1 5 Peptide Filter Array 

An ABIMED peptide arrayer with a computer controlled Gilson diluter and 
liquid handling robot (Abimed GmbH, Dusseldorf, Germany) was used to 
synthesize peptides onto an amino-PEG cellulose membrane using N-a-FMOC- 
protected amino acids and DIC/HOBT coupling 

20 chemistry. The membranes were blocked in 5% milk/TBS-T (0.1%) for lhour at 
room temperature, incubated with 0.05 |iM GST-PTIP BRCT domains (residues 
550-757) -or GST-BRCA1 BRCT domains (residues 1634-1863) in 5% milk, 50 
mM Tris/HCl (pH 7.6), 150 mM NaCl, 2 mM EDTA, 2mM DTT for 1 hour at 
room temperature and washed four times with TBS-T (0.1%). The membranes 

25 were then incubated with anti-GST conjugated HRP (Amersham) in 5% 

milk/TBS-T (0.1%) for 1 hour at room temperature, washed five times with TBS- 
T (0.1%), and subjected to chemiluminescence. 
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PTIP BRCT domains and BRCA1 BRCT domains binding to cellular 
substrates 

U20S cells were either treated with 10 Gy of ionizing radiation or mock 
irradiated and allowed to recover for 30-120 minutes. Cells were subsequently 
5 lysed in 50 mM Tris/HCl (pH7.6), 150 mM NaCl, 1 .0% NP-40, 5 mM EDTA, 2 
mM DTT, 8 |iig/mL pepstatin, 8 |ig/mL aprotinin, 8 )ig/mL leupeptin, 2 mM 
Na3V04, 10 mM NaF, 1 ^M microcystin. The lysates (0.5-2mg) were incubated 
with 20 nL glutathione beads containing 10-20 \ig of GST-PTIP BRCT domains 
(residues 550-757), GST-BRCA1 BRCT domains (residues 1634-1863), or GST 

10 for 120 minutes at 4°C. Beads were washed three times with lysis buffer. 

Precipitated proteins were eluted in sample buffer and detected by blotting with 
anti-ATM/ATR substrate (pSer/pThr)Gln antibody (CELL SIGNALING 
TECHNOLOGY, Inc Beverly, MA), polyclonal anti-53BPl (ONCOGENE 
RESEARCH PRODUCTS, San Diego, California 92121), or monoclonal anti-HA 

15 (COVANCE Inc, Princeton, NJ). For peptide competition experiments, GST- 
PTIP BRCT domains or GST-BRCA1 BRCT domains were immobilized on 
glutathionine beads and preincubated with 350 ^M of BRCTtide-optimal, 7pT, 7T, 
pSQ-library, SQ-library, or pTP-library for 1 hour at 4°C and washed three times 
with lysis buffer. 

20 

Immunofluorescence and Microscopy 

U20S cells were seeded onto 18mm2 coverslips and transfected with GFP- 
PTIP constructs FL (residues 1-757), !BRCT (residues 1-550), or (BRCT)2 
(residues 550-757) using FUGENE6 transfection reagent (Roche, Basel, 
25 Switzerland) according to manufacture's protocol. Twenty-four hours following 
transfection, the cells were either treated with 10 Gy of ionizing radiation or mock 
irradiated and allowed to recover for 120 minutes. Cells were fixed in 3% 
paraformaldehyde/2% sucrose for 15 minutes at room temperature and extracted 
with a 0.5% Triton X-100 solution containing 20 mM Tris-HCl (pH 7.8), 75 mM 
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NaCl, 300 mM sucrose, and 3 mM MgC12 for 15 minutes at room temperature. 
Slides were stained with primary antibodies at 4°C overnight, then stained with a 
Texas Red conjugated anti-mouse or anti-rabbit secondary antibody for 60 minutes 
(Molecular Probes, Eugene, OR) at room temperature. Primary antibodies used 
5 were rabbit anti-53BPl (Oncogene Research Products, San Diego, California), 
mouse anti-g-H2AX (Upstate, Charlottesville, VA), and rabbit anti-(pS/pT)Q (Cell 
Signaling Technology, Inc., Beverly, MA). Images were collected on a 
Deltavision microscope (Carl Zeiss, Thornwood, NY) and digitally deconvolved 
using SOFTWORX graphics processing software (SGI, CSIF, Stanford, CA). 

10 

Peptidoimiiimetics 

Peptide derivatives (e.g. peptidomimetics) include cyclic peptides, peptides 
obtained by substitution of a natural amino acid residue by the corresponding D- 
stereoisomer, or by a unnatural amino acid residue, chemical derivatives of the 

15 peptides, dual peptides, multimers of the peptides, and peptides fused to other 

proteins or carriers. A cyclic derivative of a peptide of the invention is one having 
two or more additional amino acid residues suitable for cyclization. These 
residues are often added at the carboxyl terminus and at the amino terminus. A 
peptide derivative may have one or more amino acid residues replaced by the 

20 corresponding D-amino acid residue. In one example, a peptide or peptide 
derivative of the invention is all-L, all-D, or a mixed D,L-peptide. In another 
example, an amino acid residue is replaced by a unnatural amino acid residue. 
Examples of unnatural or derivatized unnatural amino acids include Na-methyl 
amino acids, Coc -methyl amino acids, and p-methyl amino acids. 

25 A chemical derivative of a peptide of the invention includes, but is not 

limited to, a derivative containing additional chemical moieties not normally a part 
of the peptide. Examples of such derivatives include: (a) N-acyl derivatives of the 
amino terminal or of another free amino group, where the acyl group may be 
either an alkanoyl group, e.g., acetyl, hexanoyl, octanoyl, an aroyl group, e.g., 
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benzoyl, or a blocking group such as Fmoc (fluorenylmethyl-O-CO-), 
carbobenzoxy (benzyl-O— CO--), monomethoxysuccinyl, naphthyl-NH— CO— , 
acetylamino-caproyl, adamantyl-NH~CO-; (b) esters of the carboxyl terminal or 
of another free carboxyl or hydroxy groups; (c) amides of the carboxyl terminal or 
5 of another free carboxyl groups produced by reaction with ammonia or with a 
suitable amine; (d) glycosylated derivatives; (e) phosphorylated derivatives; (f) 
derivatives conjugated to lipophilic moieties, e.g., caproyl, lauryl, stearoyl; and (g) 
derivatives conjugated to an antibody or other biological ligand. Also included 
among the chemical derivatives are those derivatives obtained by modification of 
10 the peptide bond --CO-NH--, for example, by: (a) reduction to -CH 2 — NH— ; (b) 
alkylation to -CO--N(alkyl)--; and (c) inversion to -NH— CO-. 

A dual peptide of the invention consists of two of the same, or two 
different, peptides of the invention covalently linked to one another, either directly 
or through a spacer. 

15 Multimers of the invention consist of polymer molecules formed from a 

number of the same or different peptides or derivatives thereof. 

In one example, a peptide derivative is more resistant to proteolytic 
degradation than the corresponding non-derivatized peptide. For example, a 
peptide derivative having D-amino acid substitution(s) in place of one or more L- 
20 amino acid residue(s) resists proteolytic cleavage. 

In another example, the peptide derivative has increased permeability 
across a cell membrane as compared to the corresponding non-derivatized peptide. 
For example, a peptide derivative may have a lipophilic moiety coupled at the 
amino terminus and/or carboxyl terminus and/or an internal site. Such derivatives 
25 are highly preferred when targeting intracellular protein-protein interactions, 
provided they retain the desired functional activity. 

In another example, a peptide derivative binds with increased affinity to a 
ligand (e.g., a Polo box domain). 
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I 
I 

I 

The peptides or peptide derivatives of the invention are obtained by any 

I 

method of peptide synthesis known to those skilled in the art, including synthetic ! 
and recombinant techniques. For example, the peptides or peptide derivatives can 
be obtained by solid phase peptide synthesis which, in brief, consists of coupling 
5 the carboxyl group of the C-terminal amino acid to a resin and successively adding 
N-alpha protected amino acids. The protecting groups may be any such groups 
known in the art. Before each new amino acid is added to the growing chain, the 
protecting group of the previous amino acid added to the chain is removed. The 
coupling of amino acids to appropriate resins has been described by Rivier et al. 

10 (U.S. Pat. No. 4,244,946). Such solid phase syntheses have been described, for 
example, by Merrifield, J. Am. Chem. Soc. 85:2149, 1964; Vale et al., Science 
213:1394-1397, 1984; Marki et *L,J.Am. Chem. Soc. 10:3178, 1981, and in U.S. 
Pat. Nos. 4,305,872 and 4,316,891. In a preferred aspect, an automated peptide 
synthesizer is employed. 

15 Purification of the synthesized peptides or peptide derivatives is carried out 

by standard methods, including chromatography (e.g., ion exchange, affinity, and 
sizing column chromatography), centrifugation, differential solubility, 
hydrophobicity, or by any other standard technique for the purification of proteins. 
In one embodiment, thin layer chromatography is employed. In another 

20 embodiment, reverse phase HPLC (high performance liquid chromatography) is 
employed. 

Finally, structure-function relationships determined from the peptides, 
peptide derivatives, and other small molecules of the invention may also be used 
to prepare analogous molecular structures having similar properties. Thus, the 
25 invention is contemplated to include molecules in addition to those expressly 

disclosed that share the structure, hydrophobicity, charge characteristics and side 
chain properties of the specific embodiments exemplified herein. 

In one example, such derivatives or analogs that have the desired binding 
activity can be used for binding to a molecule or other target of interest, such as 
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any Polo-box domain. Derivatives or analogs that retain, or alternatively lack or 
inhibit, a desired property-of-interest (e.g., inhibit PBD binding to a natural 
ligand), can be used to inhibit the biological activity of a Polo-like kinase (e.g., 
Plk-1,2, or 3). 

5 In particular, peptide derivatives are made by altering amino acid sequences 

by substitutions, additions, or deletions that provide for functionally equivalent 
molecules, or for functionally enhanced or diminished molecules, as desired. Due 
to the degeneracy of the genetic code, other nucleic acid sequences that encode 
substantially the same amino acid sequence may be used for the production of 

10 recombinant peptides. These include, but are not limited to, nucleotide sequences 
comprising all or portions of a peptide of the invention that is altered by the 
substitution of different codons that encode a functionally equivalent amino acid 
residue within the sequence, thus producing a silent change. 

The derivatives and analogs of the invention can be produced by various 

1 5 methods known in the art. The manipulations that result in their production can 
occur at the gene or protein level. For example, a cloned nucleic acid sequence 
can be modified by any of numerous strategies known in the art (Sambrook et al., 
1989, Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y.). The sequence can be cleaved at 

20 appropriate sites with restriction endonuclease(s), followed by further enzymatic 
modification if desired, isolated, and ligated in vitro. 

Modified Phosphopeptides 

A phosphopeptide of the invention may include, but it is not limited to, an 
25 unnatural N-terminal amino acid of the formula (III): 
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R 1 




(HI) , 

where A 1 is an amino acid or peptide chain linked via an a-amino group; R 1 and 
R 3 are independently hydrogen, Cj. 5 branched or linear Cj. 5 alkyl, C]. 5 alkaryl, 
heteroaryl, and aryl, each of which are unsubstituted or substituted with a 
5 substituted selected from: 1 to 3 of C]. 5 alkyl, 1 to 3 of halogen, 1 to 2 of -OR 5 , 
N(R 5 )(R 6 ), SR 5 , N-C(NR 5 )NR 6 R 7 , methylenedioxy, -S(0) m R 5 , 1 to 2 of -CF 3 , - 
OCF 33 nitro, -N(R 5 )C(0)(R 6 ), -C(0)OR 5 3 -C(0)N(R 5 )(R 6 ), -lH-tetrazol-5-yl, - 
S0 2 N(R 5 )(R 6 ), -N(R 5 )S0 2 aryl, or -N(R 5 )S0 2 R 6 ; R 5 , R 6 and R 7 are independently 
selected from hydrogen, Q.5 linear or branched alkyl, Ci_ 5 alkaryl, aryl, heteroaryl, 
10 and C3.7 cycloalkyl, and where two C]_ 5 alkyl groups are present on one atom, they 
optionally are joined to form a C 3 _ 8 cyclic ring, optionally including oxygen, sulfur 
or NR 7 , where R 7 is hydrogen, or Q.5 alkyl, optionally substituted by hydroxyl; R 2 
is hydrogen, F, Ci_ 5 linear or branched alkyl, Ci_ 5 alkaryl; or R and R are joined 

to form a C3.8 cyclic ring, optionally including oxygen, sulfur, or NR , where R is 

2 3 

15 hydrogen, or Ci_ 5 alkyl, optionally substituted by hydroxyl, or R and R are joined 

to form a C3.8 cyclic ring, optionally substituted by hydroxyl and optionally 

77 7 
including oxygen, sulfur or NR , where R is hydrogen, or C]_ 5 alkyl; R is 

hydrogen, F, Q.5 linear or branched alkyl, C]. 5 alkaryl; and R 4 is hydrogen, Ci_ 5 

branched or linear Q.5 alkyl, Cj.5 alkaryl, heteroaryl, and aryl, each of which are 

20 unsubstituted or substituted with a substitutent selected from: 1 to 3 of C|. 5 alkyl, 1 
to 3 of halogen, 1 to 2 of -OR 5 , N(R 5 )(R 6 ), N-C(NR 5 )NR 6 R 7 , methylenedioxy, - 
S(0) m R 5 (where m is 0-2), 1 to 2 of -CF 3 , -OCF 3 , nitro, -N(R 5 )C(0)(R 6 ), - 
N(R 5 )C(0)(OR 6 ), -C(0)OR 5 , -C(0)N(R 5 )(R 6 ), -lH-tetrazol-5-yl, -S0 2 N(R 5 )(R 6 ), - 
N(R 5 )S0 2 aryl, or -N(R 5 )S0 2 R 6 , R 5 , R 6 and R 7 are independently selected from 

25 hydrogen, C|. 5 linear or branched alkyl, Q.5 alkaryl, aryl, heteroaryl, and C 3 . 7 
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cycloalkyl, and where two C\.$ alkyl groups are present on one atom, they 
optionally are joined to form a C 3 . 8 cyclic ring, optionally including oxygen, sulfur 
or NR 7 , where R 7 is hydrogen, or Ci_ 5 alkyl, optionally substituted by hydroxyl. 
The phosphopeptides of the invention may also include an internal 
5 unnatural internal amino acid of the formula: 



an amino acid or peptide chain linked via an oc-amino group; R 1 and R 3 are 
independently hydrogen, Ci_ 5 branched or linear d_ 5 alkyl, C]. 5 alkaryl, heteroaryl, 

10 and aryl, each of which are unsubstituted or substituted with a substitutent selected 
from: 1 to 3 of C,_ 5 alkyl, 1 to 3 of halogen, 1 to 2 of -OR 5 , N(R 5 )(R 6 ), SR 5 , N- 
C(NR 5 )NR 6 R 7 , methylenedioxy, -S(0) m R 5 (m is 1-2), 1 to 2 of -CF 3 , -OCF 3 , nitro, 
-N(R 5 )C(0)(R 6 ), -C(0)OR 5 , -C(0)N(R 5 )(R 6 ), -lH-tetrazol-5-yl, -S0 2 N(R 5 )(R 6 ), - 
N(R 5 )S0 2 aryl, or -N(R 5 )S0 2 R 6 ; R 5 , R 6 and R 7 are independently selected from 

15 hydrogen, C^ 5 linear or branched alkyl, Ci_ 5 alkaryl, aryl, heteroaryl, and C 3 _ 7 
cycloalkyl, and where two C\. 5 alkyl groups are present on one atom, they 
optionally are joined to form a C 3 . 8 cyclic ring, optionally including oxygen, sulfur 

or NR , where R is hydrogen, or C]_ 5 alkyl, optionally substituted by hydroxyl; 

2 21 
and R is hydrogen, F, Q.5 linear or branched alkyl, Ci_ 5 alkaryl; or R and R are 

20 joined to form a C 3 _ 8 cyclic ring, optionally including oxygen, sulfur or NR 7 , 

where R 7 is hydrogen, or C i_ 5 alkyl, optionally substituted by hydroxyl, or R 2 and 

R are joined to form a C 3 _ 8 cyclic ring, optionally substituted by hydroxyl and 

77 

optionally including oxygen, sulfur or NR , where R is hydrogen, or Ci_ 5 alkyl. 



25 invention, wherein an internal unnatural internal amino acid of the formula: 





The invention also includes modifications of the phosphopeptides of the 
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is present, where A is an amino acid or peptide chain linked via an a-carboxy 
group; A 1 is an amino acid or peptide chain linked via an ot-amino group; R 1 and 
R are independently hydrogen, C]_ 5 branched or linear C]. 5 alkyl, and Ci_ 5 alkaryl; 
5 R 2 is hydrogen, F, C\. 5 linear or branched alkyl, d. 5 alkaryl; or R 2 and R 1 are 
joined to form a C 3 . 8 cyclic ring, optionally including oxygen, sulfur or NR 7 , 
where R 7 is hydrogen, or C\. 5 alkyl, optionally substituted by hydroxyl; X is O or 
S; and R 5 and R 6 are independently selected from hydrogen, Ci_ 5 linear or 
branched alkyl, Ci. 5 alkaryl, aryl, heteroaryl, and C 3 . 7 cycloalkyl, and where two 
10 Cj. 5 alkyl groups are present on one atom, they optionally are joined to form a C 3 _ 8 
cyclic ring, optionally including oxygen, sulfur or NR 7 , where R 7 is hydrogen, or 

C1.5 alkyl, optionally substituted by hydroxyl; or R 5 and R 6 are joined to form a 

7 7 

C3-8 cyclic ring, optionally including oxygen, sulfur or NR , where R is hydrogen, 
or C 1.5 alkyl, optionally substituted by hydroxyl. 
15 The phosphopeptides of the invention may also include a C-terminal 

unnatural internal amino acid of the formula: 



R 1 




Q 
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where A 2 is an amino acid or peptide chain linked via an a-carboxy group; R 1 and 
R 3 are independently hydrogen, Cm branched or linear Cm alkyl, Cm alkaryl, 
heteroaryl, and aryl, each of which are unsubstituted or substituted with a 
substituted selected from: 1 to 3 of Cm alkyl, 1 to 3 of halogen, 1 to 2 of -OR 5 , 
5 N(R 5 )(R 6 ), SR 5 , N-C(NR 5 )NR 6 R 7 , methylenedioxy, -S(0) m R 5 , 1 to 2 of -CF 3 , - 
OCF3, nitro, -N(R 5 )C(0)(R 6 ), -C(0)OR 5 , -C(0)N(R 5 )(R 6 ), -lH-tetrazol-5-yl, - 
S0 2 N(R 5 )(R 6 ), -N(R 5 )S0 2 aryl, or -N(R 5 )S0 2 R 6 ; R 5 , R 6 and R 7 are independently 
selected from hydrogen, Ci_ 5 linear or branched alkyl, Ci_ 5 alkaryl, aryl, heteroaryl, 
and C3.7 cycloalkyl, and where two Cm alkyl groups are present on one atom, they 

10 optionally are joined to form a C 3 _ 8 cyclic ring, optionally including oxygen, sulfur 
or NR 7 ,. where R 7 is hydrogen, or Ci_ 5 alkyl, optionally substituted by hydroxyl; R 2 
is hydrogen, F, C1.5 linear or branched alkyl, Cm alkaryl; or R 2 and R 1 are joined 
to form a C 3 _ 8 cyclic ring, optionally including oxygen, sulfur or NR 7 , where R 7 is 
hydrogen, or Cm alkyl, optionally substituted by hydroxyl; or R 2 and R 3 are joined 

15 to form a C 3 _ 8 cyclic ring, optionally substituted by hydroxyl and optionally 
including oxygen, sulfur or NR 7 , where R 7 is hydrogen, or Q_ 5 alkyl; R 2 is 
hydrogen, F, Cm linear or branched alkyl, Cm alkaryl; and Q is OH , OR 5 , or 
NR 5 R 6 , where R 5 , R 6 are independently selected from hydrogen, Cm linear or 
branched alkyl, C M alkaryl, aryl, heteroaryl, and C 3 _ 7 cycloalkyl, and where two 

20 Cm alkyl groups are present on one atom, they optionally are joined to form a C 3 . 8 
cyclic ring, optionally including oxygen, sulfur or NR 7 , where R 7 is hydrogen, or 
Cm alkyl, optionally substituted by hydroxyl. Methods well known in the art for 
modifying peptides are found, for example, in " Remington: The Science and 
Practice of Pharmacy " (20th ed., ed. A.R. Gennaro, 2000, Lippincott Williams & 

25 Wilkins, Philadelphia). 



-193- 



Therapeutic Uses 

Peptide synthesis and conjugation 

Phosphopeptides of the invention are prepared as detailed above. 
Alternatively, phosphopeptides can be prepared using standard FMOC chemistry 
5 on 2-chlorotrityl chloride resin (Int. J. Pept. Prot. Res. 38, 1991, 555-61). 

Cleavage from the resin is performed using 20% acetic acid in dichloromehane 
(DCM), which leaves the side chain still blocked. Free terminal carboxylate 
peptide is then coupled to 4'(aminomethy)-fluorescein (Molecular Probes, A- 
1351; Eugene, OR) using excess diisopropylcarbodiimide (DIC) in 

10 dimethylformamide (DMF) at room temperature. The fluorescent N-C blocked 
peptide is purified by silica gel chromatography (10% methanol in DCM). The N 
terminal FMOC group is then removed using piperidine (20%) in DMF, and the 
N-free peptide, purified by silica gel chromatography (20% methanol in DCM, 
0.5% HO Ac). Finally, any t-butyl side chain protective groups are removed using 

15 95% trifluoroacetic acid containing 2.5 % water and 2.5 % triisopropyl silane. 
The peptide obtained in such a manner should give a single peak by HPLC and is 
sufficiently pure for carrying on with the assay described below. 

Phosphopeptide Modifications 

20 It is understood that modifications can be made to the amino acid residues 

of the phosphopeptides of the invention, to enhance or prolong the therapeutic 

efficacy and/or bioavailability of the phosphopeptide. Accordingly, ot-amino acids 

having the following general formula (I): 

R 




(I) 
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where R defines the specific amino acid residue, may undergo various 
modifications. Exemplary modifications of ot-amino acids, include, but are not 
limited to, the following formula (II): 

f 

I R 2Q 

R 3 

(II) 

5 R|, R 2 , R3, R4, and R 5 , are independently hydrogen, hydroxy, nitro, halo, C1.5 
branched or linear alkyl, Ci_ 5 alkaryl, heteroaryl, and aryl; wherein the alkyl, 
alkaryl, heteroaryl, and aryl may be unsubstituted or substituted by one or more 
substituents selected from the group consisting of C i_ 5 alkyl, hydroxy, halo, nitro, 
C1.5 alkoxy, C]_ 5 alkylthio, trihalomethyl, Ci_ 5 acyl, arylcarbonyl, 

10 heteroarylcarbonyl, nitrile, C1.5 alkoxycarbonyl, oxo, arylalkyl (wherein the alkyl 
group has from 1 to 5 carbon atoms) and heteroarylalkyl (wherein the alkyl group 
has from 1 to 5carbon atoms); alternatively, R\ and R 2 are joined to form a C 3 _ 8 
cyclic ring, optionally including oxygen, sulfur or hydrogen, or Ci_ 5 alkyl, 
optionally substituted by hydroxyl; or R 2 and R 3 are joined to form a C 3 _8 cyclic 

15 ring, optionally substituted by hydroxyl and optionally including oxygen, sulfur, 
Ci_ 5 aminoalkyl, or C1.5 alkyl. Methods well known in the art for making 
modifications are found, for example, in " Remington: The Science and Practice of 
Pharmacy " (20th ed., ed. A.R. Gennaro, 2000, Lippincott Williams & Wilkins), 
hereby incorporated by reference. 

20 

Assays and high throughput assays 

Fluorescence polarization assays can be used in displacement assays to 
identify small molecule peptidomimetics. The following is an exemplary method 
for use of fluorescence polarization, and should not be viewed as limiting in any 
25 way. For screening, all reagents are diluted at the appropriate concentration and 
the working solution, kept on ice. The working stock concentration for GST and 
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GST fusion proteins are -4 ng/jiL, Fluorescein-labeled phosphopeptides can be 
used at a concentration of 1.56 fmol/^iL, while cold phosphopeptides and peptides 
at 25 pmol/|iL. Samples are incubated at a total volume of 200 (iL per well in 
black flat bottom plates, Biocoat, #359135 low binding (BD Biosciences; 
5 Bedford, MA). Assays are started with the successive addition using a Labsystem 
Multi-Drop 96/384 device (Labsystem; Franklin, MA) of 50|iL test compounds, 
diluted in 10% DMSO (average concentration of 28 jiM), 50^lL of 50 mM MES- 
pH 6.5, 5Q|iL of Fluorescein-phosphopeptide, 50(iL of GST-Plk-1 PBD, 50|iL of 
unlabeled phosphopeptide, or unphosphorylated peptide can be used as a negative 

10 control. Once added, all the plates are placed at 4°C. Following overnight 

incubation at 4°C, the fluorescence polarization is measured using a Polarion plate 
reader (Tecan, Research Triangle Park, NC). A xenon flash lamp equipped with 
an excitation filter of 485 nm and an emission filter of 535 nm. The number of 
flashes is set at 30. Raw data can then be converted into a percentage of total 

15 interaction(s). All further analysis can be performed using SPOTFIRE data 
analysis software (SPOTFIRE, Somerville, MA) 

Upon selection of active compounds, auto-fluorescence of the hits is 
measured as well as the fluorescein quenching effect, where a measurement of 
2000 or more units indicates auto-fluorescence, while a measurement of 50 units 

20 indicates a quenching effect. Confirmed hits can then be analyzed in dose- 
response curves (IC 50 ) for reconfirmation. Best hits in dose-response curves can 
then be assessed by isothermal titration calorimetry using GST-Plk-1 PBD. 

Alternate binding and displacement assays 
25 Fluorescence polarization assays are but one means to measure 

phosphopeptide-protein interactions in a screening strategy. Alternate methods for 
measuring phosphopeptide-protein interactions are known to the skilled artisan. 
Such methods include, but are not limited to mass spectrometry (Nelson and 
Krone, J. MoL Recognit., 12:77-93, 1999), surface plasmon resonance (Spiga et 
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al, FEBS Lett., 51 1:33-35, 2002; Rich and Mizka, J. Mol. Recognit., 14:223-8, 
2001; Abrantes et al, Anal. Chem., 73:2828-35, 2001), fluorescence resonance 
energy transfer (FRET) (Bader et al, J. Biomol. Screen, 6:255-64, 2001; Song et 
al, Anal. Biochem. 291:133-41, 2001; Brockhoff et al., Cytometry, 44:338-48, 
5 2001), bioluminescence resonance energy transfer (BRET) (Angers et al, Proc. 
Natl. Acad. Sci. USA, 97:3684-9, 2000; Xu et al, Proc. Natl Acad. ScL USA, 
96:151-6, 1999), fluorescence quenching (Engelborghs, Spectrochim. Acta A. Mol 
Biomol Spectrosc, 57:2255-70, 70; Geoghegan et al, Bioconjug. Chem. 1 1:71-7, 
2000), fluorescence activated cell scanning/sorting (Barth et al, J. Mol Biol, 
10 301:751-7, 2000), ELISA, and radioimmunoassay (RIA). 

Test extracts and compounds 

In general, peptidomimetic compounds that affect phosphopeptide-protein 
interactions are identified from large libraries of both natural products, synthetic 
15 (or semi-synthetic) extracts or chemical libraries, according to methods known in 
the art. 

Those skilled in the art will understand that the precise source of test 
extracts or compounds is not critical to the screening procedure(s) of the 
invention. Accordingly, virtually any number of chemical extracts or compounds 

20 can be screened using the exemplary methods described herein. Examples of such 
extracts or compounds include, but are not limited to, plant-, fungal-, prokaryotic- 
or animal-based extracts, fermentation broths, and synthetic compounds, as well as 
modifications of existing compounds. Numerous methods are also available for 
generating random or directed synthesis (e.g., semi-synthesis or total synthesis) of 

25 any number of chemical compounds, including, but not limited to, saccharide-, 

lipid-, peptide-, and nucleic acid-based compounds. Synthetic compound libraries 
are commercially available from, for example, Brandon Associates (Merrimack, 
NH) and Aldrich Chemical (Milwaukee, WI) 
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Alternatively, libraries of natural compounds in the form of bacterial, 
fungal, plant, and animal extracts are commercially available from a number of 
sources, including, but not limited to, Biotics (Sussex, UK), Xenova (Slough, UK), 
Harbor Branch Oceangraphics Institute (Ft. Pierce, FL), and PharmaMar, U.S.A. 
5 (Cambridge, MA). In addition, natural and synthetically produced libraries are 
produced, if desired, according to methods known in the art (e.g., by combinatorial 
chemistry methods or standard extraction and fractionation methods). 
Furthermore, if desired, any library or compound may be readily modified using 
standard chemical, physical, or biochemical methods. 

10 

Administration of phosphopeptides, and peptidomimetic small molecules 
By selectively disrupting or preventing a phosphoprotein from binding to 
its natural partner(s) through its binding site, the phosphopeptides of the invention, 
or derivatives, or peptidomimetics thereof, can significantly alter the biological 

15 activity or the biological function of a polo-like kinase. Therefore, the 

phosphopeptides, or derivatives thereof, of the invention can be used for the 
treatment of a disease or disorder characterized by inappropriate cell cycle 
regulation or apoptosis. 

Diseases or disorders characterized by inappropriate cell cycle regulation, 

20 include hyperproliferative disorders, such as neoplasias. Examples of neoplasms 
include, without limitation, leukemias (e.g., acute leukemia, acute lymphocytic 
leukemia, acute myelocytic leukemia, acute myeloblasts leukemia, acute 
promyelocytic leukemia, acute myelomonocytic leukemia, acute monocytic 
leukemia, acute erythroleukemia, chronic leukemia, chronic myelocytic leukemia, 

25 chronic lymphocytic leukemia), polycythemia vera, lymphoma (Hodgkin's 

disease, non-Hodgkin's disease), Waldenstrom's macroglobulinemia, heavy chain 
disease, and solid tumors such as sarcomas and carcinomas (e.g., fibrosarcoma, 
myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, 
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lymphangioendotheliosarcoma, synovioma, mesothelioma, Ewing's tumor, 
leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast 
cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal cell 
carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, 
5 papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct 
carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilm's tumor, 
cervical cancer, uterine cancer, testicular cancer, lung carcinoma, small cell lung 
carcinoma, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, 

10 medulloblastoma, craniopharyngioma, ependymoma, pinealoma, 

hemangioblastoma, acoustic neuroma, oligodenriglioma, schwannoma, 
meningioma, melanoma, neuroblastoma, and retinoblastoma). 

Cells undergoing inappropriate apoptosis include neurons in a patient who 
has a neurodegenerative disease (e.g., Parkinson's disease, Alzheimer's disease, or 

15 stroke), and cardiomyocytes (e.g., after myocardial infarction or over the course of 
congestive heart failure). Compositions of the invention, i.e., inhibitors of Plk-3, 
may be useful in treating a cell undergoing inappropriate apoptosis. 

A Plk-1 PBD-binding phosphopeptide or peptidomimetic small molecule 
may be administered within a pharmaceutically-acceptable diluent, carrier, or 

20 excipient, in unit dosage form. Conventional pharmaceutical practice may be 
employed to provide suitable formulations or compositions to administer the 
compounds to patients suffering from a disease that is caused by excessive cell 
proliferation. Administration may begin before the patient is symptomatic. Any 
appropriate route of administration may be employed, for example, administration 

25 may be parenteral, intravenous, intra-arterial, subcutaneous, intramuscular, 

intracranial, intraorbital, ophthalmic, intraventricular, intracapsular, intraspinal, 
intracisternal, intraperitoneal, intranasal, aerosol, suppository, or oral 
administration. For example, therapeutic formulations may be in the form of 
liquid solutions or suspensions; for oral administration, formulations may be in the 
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form of tablets or capsules; and for intranasal formulations, in the form of 
powders, nasal drops, or aerosols. 

The pharmaceutical compositions of the present invention are prepared in a 
manner known per se 5 for example by means of conventional dissolving, 
5 lyophilising, mixing, granulating or confectioning processes. Methods well 

known in the art for making formulations are found, for example, in " Remington: 
The Science and Practice of Pharmacy " (20th ed., ed. A.R. Gennaro, 2000, 
Lippincott Williams & Wilkins, Philadelphia). 

Solutions of the active ingredient, and also suspensions, and especially 

10 isotonic aqueous solutions or suspensions, are preferably used, it being possible, 
for example in the case of lyophilized compositions that comprise the active 
ingredient alone or together with a carrier, for example mannitol, for such 
solutions or suspensions to be produced prior to use. The pharmaceutical 
compositions may be sterilized and/or may comprise excipients, for example 

15 preservatives, stabilisers, wetting and/or emulsifying agents, solubilisers, salts for 
regulating the osmotic pressure and/or buffers, and are prepared in a manner 
known per se, for example by means of conventional dissolving or lyophilising 
processes. The said solutions or suspensions may comprise viscosity-increasing 
substances, such as sodium carboxymethylcellulose, carboxymethylcellulose, 

20 dextran, poly vinylpyrrolidone or gelatin. 

Suspensions in oil comprise as the oil component the vegetable, synthetic 
or semi-synthetic oils customary for injection purposes. There may be mentioned 
as such especially liquid fatty acid esters that contain as the acid component a 
long-chained fatty acid having from 8 to 22, especially from 12 to 22, carbon 

25 atoms, for example lauric acid, tridecylic acid, myristic acid, pentadecylic acid, 
palmitic acid, margaric acid, stearic acid, arachidic acid, behenic acid or 
corresponding unsaturated acids, for example oleic acid, elaidic acid, erucic acid, 
brasidic acid or linoleic acid, if desired with the addition of anti oxidants, for 
example, vitamins E, 0-carotene, or 3,5-di-tert-butyl-4-hydroxytoluene. The 
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alcohol component of those fatty acid esters has a maximum of 6 carbon atoms 
and is a mono- or poly-hydroxy, for example a mono-, di- or tri-hydroxy, alcohol, 
for example methanol, ethanol, propanol, butanol or pentanol or the isomers 
thereof, but especially glycol and glycerol. The following examples of fatty acid 
5 esters are there fore to be mentioned: ethyl oleate, isopropyl myristate, isopropyl 
palmitate, "Labrafil M 2375" (poly oxyethylene glycerol trioleate, Gattefoss, 
Paris), "Miglyol 812" (triglyceride of saturated fatty acids with a chain length of 
to Ci2 3 Huls AG, Germany), but especially vegetable oils, such as cottonseed 
oil, almond oil, olive oil, castor oil, sesame oil, soybean oil and more especially 

10 groundnut oil. 

The injection compositions are prepared in customary manner under sterile 
conditions; the same applies also to introducing the compositions into ampoules or 
vials and sealing the containers. 

Pharmaceutical compositions for oral administration can be obtained by 

15 combining the active ingredient with solid carriers, if desired granulating a 

resulting mixture, and processing the mixture, if desired or necessary, after the 
addition of appropriate excipients, into tablets, drage cores or capsules. It is also 
possible for them to be incorporated into plastics carriers that allow the active 
ingredients to diffuse or be released in measured amounts. 

20 Suitable carriers are especially fillers, such as sugars, for example lactose, 

saccharose, mannitol or sorbitol, cellulose preparations and/or calcium phosphates, 
for example tricalcium phosphate or calcium hydrogen phosphate, and binders, 
such as starch pastes using for example corn, wheat, rice or potato starch, gelatin, 
tragacanth, methylcellulose, hydroxypropylmethylcellulose, sodium 

25 carboxymethylcellulose and/or polyvinylpyrrolidone, and/or, if desired, 

disintegrates, such as the above-mentioned starches, also carboxymethyl starch, 
crosslinked polyvinylpyrrolidone, agar, alginic acid or a salt thereof, such as 
sodium alginate. Excipients are especially flow conditioners and lubricants, for 
example silicic acid, talc, stearic acid or salts thereof, such as magnesium or 
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calcium stearate, and/or polyethylene glycol. Drage cores are provided with 
suitable, optionally enteric, coatings, there being used, inter alia, concentrated 
sugar solutions which may comprise gum arabic, talc, polyvinylpyrrolidone, 
polyethylene glycol and/or titanium dioxide, or coating solutions in suitable 
5 organic solvents, or, for the preparation of enteric coatings, solutions of suitable 
cellulose preparations, such as ethylcellulose phthalate or 

hydroxypropylmethylcellulose phthalate. Capsules are dry-filled capsules made of 
gelatin and soft sealed capsules made of gelatin and a plasticiser, such as glycerol 
or sorbitol. The dry-filled capsules may comprise the active ingredient in the form 

10 of granules, for example with fillers, such as lactose, binders, such as starches, 
and/or glidants, such as talc or magnesium stearate, and if desired with stabilisers. 
In soft capsules the active ingredient is preferably dissolved or suspended in 
suitable oily excipients, such as fatty oils, paraffin oil or liquid polyethylene 
glycols, it being possible also for stabilisers and/or antibacterial agents to be 

1 5 added. Dyes or pigments may be added to the tablets or drage coatings or the 
capsule casings, for example for identification purposes or to indicate different 
doses of active ingredient. 

The pharmaceutical compositions comprise from approximately 1% to 
approximately 95%, preferably from approximately 20% to approximately 90%, 

20 active ingredient. Pharmaceutical compositions according to the invention may 
be, for example, in unit dose form, such as in the form of ampoules, vials, 
suppositories, drages, tablets or capsules. 

The formulations can be administered to human patients in a therapeutically 
effective amount (e.g., an amount that decreases, suppresses, attenuates, 

25 diminishes, arrests, or stabilizes the development or progression of a disease, 
disorder, or infection in a eukaryotic host organism). The preferred dosage of 
therapeutic agent to be administered is likely to depend on such variables as the 
type and extent of the disorder, the overall health status of the particular patient, 
the formulation of the compound excipients, and its route of administration. 
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For any of the methods of application described above, a Plk-1 PBD- 
interacting small molecule may be applied to the site of the needed therapeutic 
event (for example, by injection), or to tissue in the vicinity of the predicted 
therapeutic event or to a blood vessel supplying the cells predicted to require 
5 enhanced therapy. 

The dosages of Plk-1 PBD-interacting small molecule(s) depends on a 
number of factors, including the size and health of the individual patient, but, 
generally, between 0.1 mg and 1000 mg inclusive are administered per day to an 
adult in any pharmaceutically acceptable formulation. In addition, treatment by 
10 any of the approaches described herein may be combined with more traditional 
therapies. 

Combination Therapy 

If desired, treatment with Plk-1 PBD-interacting small molecule may be 

15 combined with more traditional therapies for the proliferative disease such as 
surgery or administration of chemotherapeutics or other anti-cancer agents, 
including, for example, y-radiation, alkylating agents (e.g., nitrogen mustards such 
as cyclophosphamide, ifosfamide, trofosfamide, and chlorambucil; nitrosoureas 
such as carmustine, and lomustine; alkylsulphonates such as bisulfan and 

20 treosulfan; triazenes such as dacarbazine; platinum-containing compounds such as 
cisplatin and carboplatin), plant alkaloids (e.g., vincristine, vinblastine, 
anhydrovinblastine, vindesine, vinorelbine, paclitaxel, and docetaxol), DNA 
topoisomerase inhibitors (e.g., etoposide, teniposide, topotecan, 9- 
aminocamptothecin, (campto) irinotecan, and crisnatol), mytomycins (e.g., 

25 mytomicin C), antifolates (e.g., methotrexate, trimetrexate, mycophenolic acid, 
tiazofurin, ribavirin, EICAR, hydroxyurea, and deferoxamine), uracil analogs (5- 
fluorouracil, floxuridine, doxifluridine, and ratitrexed), cytosine analogs 
(cytarbine, cytosine arabinoside, and fludarabine), purine analogs (e.g., 
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mercaptopurine, and thioguanine), hormonal therapies (e.g., tamoxifen, raloxifene, 
megestrol, goserelin, leuprolide acetate, flutamide, and bicalutamide), vitamin D3 
analogs (EB 1089, CB 1093, and KH 1060), vertoporfin, phthalocyanine, 
photosensitizer Pc4, demethoxy-hypocrellin A, interferon-a, interferon-y, tumor 
5 necrosis factor, lovastatin, l-methyl-4-phenylpyridinium ion, staurosporine, 
actinomycin D, dactinomycin, bleomycin A2, bleomycin B2, adriamycin, 
peplomycin, daunorubican, idarubican, epirubican, pirarubican, zorubican, 
mitoxantrone, and verapamil. 

10 Other Embodiments 

From the foregoing description, it is apparent that variations and 
modifications may be made to the invention described herein to adopt it to various 
usages and conditions. Such embodiments are also within the scope of the 
following claims. 

15 All patents and publications mentioned in this specification are hereby 

incorporated by reference to the same extent as if each independent publication or 
patent application, including 60/426,132, was specifically and individually 
indicated to be incorporated by reference. 

20 What is claimed is: 
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